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LABORATORY MEASUREMENT OF PARENTAL BEHAVIOR 


ROBERT E 


SCHULMAN, DONALD J. SHOEMAKER,' ann IRVIN MOELIS 


University of Illinois 


Lacking adequate internal measures of child 
personality functioning, clinicians have tradi- 
tionally turned to an attempted analysis of 
the parent-child interaction in order to un- 
derstand the etiological factors involved in 
the development of maladjustive behavior in 
children. The primary methods utilized in 
such clinical investigation have been the in- 
terview and psychological testing of the par- 
ents. While some studies have suggested that 
a sufficiently controlled interview can yield 
predictively useful information (Fitzgerald, 
1954; Putney, 1947), the defensive tendencies 
in interviewees leading to inadequate report 
and distortion are generally recognized by 
those utilizing the method. Psychological tests 
have as yet demonstrated few consistent rela- 
tions between test scores of the parents and 
the occurrence of problems in their children. 

An obvious alternative to the reliance on 
questionable data obtained in an interview is 
the use of a home visit. This method of ob- 
taining information regarding parent-child in- 
teraction patterns has been used by social 
workers and in more intensive psychological 
research, but is rarely seen as a routine pro- 
cedure in psychological or psychiatric clinics. 
The obvious reasons for its relative disuse are 
expense and lack of quantifiable measure- 
ment techniques. 

Another alternative, one utilized as a part 
of a larger investigation of parent-child rela- 
tions conducted by one of the present writers 
and other staff members of the University of 
Illinois Psychological Clinic, is the measure- 
ment of parent-child interaction in the clinic 
playroom situation. In utilizing this method, 
either one or both of the parents have been 
invited into the playroom with the child, with 
the instructions either to play freely with the 
child or to interact with the child on some 
predetermined task. Various measures of the 
observed interaction have been obtained, and 
attempts to relate these measures to child be- 
havior outside the playroom have been made. 


1 Now at Southern Illinois University 


Although this method is similar to that uti 
lized by Bishop (1951) and Moustakas, 
Sigel, and Schalock (1956), it differs in that 
the primary focus is on the relation existing 
between parent-child behavior in the play- 
room and child behavior in the general life 
situation. If such relations can be demon- 
strated, then the possibility arises of using 
the parent-child interaction in the clinic to 
he}p locate factors in the general parent-child 
interaction that are related to child malad- 
justment. 

The present study focused on one aspect of 
child behavior (aggression) and two aspects 
of parental behavior (frustration and model) 
that, on the basis of previous research (Leton, 
1958; Levin & Sears, 1956), were hypothe- 
sized to be related to aggression in children. 
In considering frustration, we were concerned 
with the parents’ excessive limiting or con- 
trolling of the child’s behavior with resulting 
frustration to the child. In considering the 
parents as a model for aggression, we were 
concerned with the example for aggressive be- 
havior that the parents might present to the 
child in the interaction with each other in 
the child’s presence. The hypotheses tested 
were (a) the parents of conduct problem chil- 
dren will exhibit significantly more control 
over the behavior of the child than will par- 
ents of nonconduct problem children and, 
(6), the parents of conduct problem children 
will exhibit significantly more aggression be- 
tween themselves than will 
nonconduct problem children. 


the parents of 


METHOD 
Delineation of the Variables 


On the basis of pilot study, nine behavior rating 
categories were specially constructed for use in the 
present research. Four of the categories deal with 
the aggressive model variables, five deal with the 
parental control variables. 


CONTROL VARIABLES 


pd: Parental domination—Parent dominates the 
task, and restricts the child’s behavior 
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110 R. E. ScHutman, D. J. 
pr: Parental rejection—-Parent rejects child’s be- 
havior in the task situation. 

Parent takes over (literally)—-Child stands 
aside while the adult goes ahead to do the task. 
ph: Parent hostile to child—Not physically, but is 

overcritical, forbids certain behaviors, does not 

reward child for his effort. 

ps: Parent subtle direction—Parent offers aid if 
child will do it a certain way and gives child in- 
formation which tends to bias child in parent’s 
direction. 


pt 


AGGRESSION MODEL VARIABLES 
Parents argue—Parents disagree between them- 
selves over how the task should be done. 

Dominance—One parent attempts to dominate 
and control the behavior of the other. 

Hostility between parents—One or both parents 
are aggressive towards the other, not physically, 
but through remarks and gestures 

cr: Criticism—One parent 
other parent’s behavior. 


criticizes and corrects 


These categories were scored by two trained ob 
servers in 10-second blocks of time. An automatic 
timing device flashed a light on every 10 seconds; at 
that time the observer recorded the that 
had occurred in the last 10 seconds. Behavior was 
rated by the two observers for 45 minutes or 270 
10-second blocks of time. While the parents were en- 
gaged in many activities, and since the experimenter 
was interested in relatively few behaviors, the perti- 
nent behavior did not arise in every 10-second block. 

In addition to the behavior rating categories, sev- 
eral six-point rating scales were included to give ad- 
ditional information concerning the overall effective- 
ness of the family unit, cooperation and hostility 
between the parents, rejecting behavior by the par- 
ents, a love-hostility dimension, and an autonomy- 
control dimension. The two observers rated each 
family on these scales immediately after the 45- 
minute observation period. 

The interrater reliability coefficients, for each ob- 
servation and rating scale (Tables 3 and 4 respec- 
tively), were determined by computing a Pearson r 
between the scores of the two raters for each family. 


behavior 


Subjects 


Forty-one males, 8 through 12 years old, and their 
parents served as subjects. Twenty children who were 
exhibiting conduct problems were selected upon the 
recommendation of psychologists and social workers 
who were working with them psychotherapeutically. 
As further verification of a conduct problem, the 
mother of each child filled out the Peterson Problem 
Check List which provides a measure of the conduct 
and personality problems seen in a child by others. 
The nonconduct problem children were drawn from 
the public schools by having teachers select children 
who were not behavior problems in school. Mothers 
of this group also filled out the Problem Check List. 

Table 1 presents basic information about these 


SHOEMAKER, 


Moe LIs 


rABLE 1 


able 


1 Check List 7.3 2.43 
) SD 1.56 
M 113.76 
SD 17.23 
\ge of i ? ‘ M 119 mos 
[ SD 10.49 


athers” ocx n Me 3.23 Mf 3.52 
1 D 1.66 


groups. As Table 1 indicates, the groups were not 
significantly different in measured characteristi: 
except score on the Problem Check List. 

t tests were used to test for differences between 
means. The possible range on the Peterson Problem 
Check List is 0 to 10, low scores indicating absenc« 
of a problem. The estimated IQ scores were obtained 
by prorating the three subtests (Information, Vo 
cabulary, and Block Design) as described by Wechs- 
ler (1949), while the social class ratings were ob 
tained by giving fathers’ occupations a score as 
described by Warner, Meeker, and Eells (1949). 

It was necessary to make contact with 36 families 
to obtain 21 nonconduct problem (NC) subjects. 
Thus, 58.33% of those contacted agreed to serve as 
research subjects. In the conduct problems (C) group 
only two families refused to 
search, 

In order to keep the identity 
conduct, nonconduct) _rom 


cooperate in this re- 


of a family (ie., 


the observers, the spe- 


cific experimental appointments were made by people 


affiliated with the University Psychological Clinic, 
but not by the two observers. During the experi- 
mental session and all the ratings, the two experi- 
menters were not informed about the group identity 
of any of the families 


Materials 


The parent-child interaction took place in a car- 
peted playroom equipped with one-way mirror and 
microphone. Directly in front of the mirror was a 
table on which the experimenters had constructed a 
plywood garage, school, church, 
type normally used in play 
one-story buildings with no 
could easily manipulate small dolls and doll furni- 
ture in the buildings. The doll figures consisted of 
two adult males, two adult females, a girl, a boy, 
and a baby. In addition to these materials directly 
associated with the task presented to the family, the 
room also contained numerous toys considered highly 
desirable by most children. These included a small 
wading pool with boats, a drum set, blackboard and 
chalk, doctor’s set, Lincoln logs, beads, and a small 
football and baseball. These toys 


and house of the 
therapy. These were all 
roofs, so the children 


were chosen be- 





LABORATIORY 


cause they were generally much more desirable than 
the task materials. Through the use of these desirabk 
toys and the instructions to the parents (see next 
section) which stressed the importance of keeping 
the child working with the task materials, the ex 
perimenters intended to create a situation analogous 
to the socialization process in which the parent, re 
acting to social pressure, must attempt to guide tl 

child away from activities that might } 


trinsically desirable towa: l 


“constructive” or socially a 

was hoped that the playroom situati 
even if only 
normally 
+} 


to a mild degree, int 
occurring in the home 
ie relation betwee n playroom 
behavior. Observation suggest 

rs were successful in creatis 
child as to whether to coopera 


the task or to play with the 


Playroom Task 


bal imaginative stories 
playroom. The parents wer 
tory leads) of 1 stories whi 
were to complete. In makings 
the parents’ job to keep th 
task. The task equipment could 
tk child wishec 


al of those gi 


le parents and 
Ss are typi 

outside pl 

called to him to come inside 


family is in the hou 


The boy is 


Aving 


1] Procedure 


i mation 
After this tl 
playroom 
the presence 
started ar 
the ston 
enter 
periment 
nan 


tape 


, a torn f +} 


btained from the parents th 
’ birth date, and answered questions the 
At this time, E 


who the parent 


parent 
indicated th 1e did not know 


were and th 1t to tell him 
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In this way Ez was able to tally 
still unaware of the group identification of the famil: 


the frequency count 


RESULTS 


Before considering the results relevant to 
the major hypotheses, it is necessary to point 
out that, as the intercorrelations in Table 2 
indicate, the five control variables apparent); 
were not measuring the same thing. Parental 
rejection and parental hostility appear to bi 
measuring a common variable of love-hostilit: 
ind seem to be independent of the other thre: 
While the categories of pat 
(pd) and parent takes over 
negative ly correlated. 


low positive relation 


control variables 
ent domination 
(pt) are they how a 

to an autonomy-contro! 
This 


be mutually exclusive 


rating scale (.19) that thes 


Suggests 


two variables may 


I I 


incompatible methods of parental cont 
their reaction to the situation. parent 


I 


show either pd or pt, but not both 


sidering the results 


therefore, it is 1 
to be aware that we are ] 


dealing wi 


dimensions, love-hostility and 
than 


contro] 


° - ° ° . ’ 
a singie dimer ~ n oft contro 


were ana ¢ Vy use 
median test (corrected for continuity) as dé 


scribed by 


Siegel (1956). The two raters’ fre 


7 
i 
quency scores for each 


iriable were 


ior ¢ 


rater’s 


into 


bined 
family on each variable 


a joint frequency score 


Each 


1 combination of 


hl 


for a category was actua 
both 
the family behavior each paret 
but the main i 
combined behavior pattern of the 
they related to the child in the 
situation. 


father bel 


mother ind ) lavior I 


individually, 
parents a 
standard 
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The results of the chi square analyses are 
presented in Table 3. The measures of pa- 
rental rejection and hostility were both sig- 
nificant beyond the .01 level, indicating that 
the C group parents were more hostile and 
rejecting than the NC group parents. The cor- 
relation between pr and ph was .77. 

In addition to these behavior ratings, the 
scores on the six rating scales were divided 
into high (4, 5, and 6) and low (1, 2, and 3) 
for each scale and analyzed by use of a 2 X 2 
contingency table corrected for continuity 
(Siegel, 1956). Two of these scales showed 
significant chi squares; the rejecting behavior 
continuum, and the love-hostility continuum. 
These results again indicated that the C group 


TABLE 3 


DIFFERENCE BETWEEN CONDUCT AND 
NoNCONDUCT PROBLEM FAMILIES 


Behavior category 


above 


velow 


Parent takes over 


helow 


subtle directior 


Dominance 
above 


below 


Hostility between parents 
above 11 10 
below 9 Il 
Criticism 
above 12 9 
below 5 12 


gnificant beyor OL level of c« 


R. E. ScHuLMAN, D. J. SHOEMAKER, AND I. 


MOoELIS 


TABLE 4 


Group DIFFERENCES ON RATING SCALES 


Rater 
relia- 
x? bility 


Scale 
Rejecting behavior 13.535*** 85 
high 
low 
Cooperat 
high 
low 
Hostility betwee: 
high 
low 
Overall effect 
high 
low 
Love-hostility 
high (hostilit 
low (love) 
Autono 
high (control) 
low 1utonom 


1y-contr 


** Significant 
tailed test 

*** Significant 
tailed tes 


parents were more hostile and rejecting of 
their children than the NC group parents 
These two rating scales were correlated .78. 

If no scorable activity took place during a 
10-second scoring period, a check mark was 
entered into the scoring box. This provided 
an indication of the general activity level in 
the playroom situation. 

There were no difference between the aver- 
age combined (for both raters) number of 
check marks between the two groups. The 
NC group had a mean of 182.95 with a sigma 
of 71.98 and the C group had a mean of 
181.05 with a sigma of 58.15. 


As an overall measure of reliability, the per- 
centage agreement between the two raters was 


calculated. Agreements were counted when 
both raters scored the same behavior, and in 
the same time sequence within a box. The 
average percentage agreement for the C group 
families was 74.02% with a sigma of 6.86. 
For the NC group it 76.79% with a 
sigma of 5.29 


was 


DISCUSSION 


The results of this study indicate that, in 
this situation, the parents of conduct prob- 
lem children exhibit significantly more hos- 
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tile behavior toward their children than do 
the parents of nonconduct problem children. 
[hese parents also reject their children sig- 
nificantly more than do parents of noncon- 
duct problem children. 

On the basis of these results, it is tempting 
to assume that we have shown a cause and 
effect relationship, whereby rejection and hos- 
tility on the part of the parents lead to ag- 
gressive behavior in children. The results of 
this study, however, can only indicate what 
kind of parent behavior might be expected i: 
families where there is a child who is mani 
festing some form of a conduct problem. One 
of the purposes of this work, it will be re 
called, was to make a first attempt at using 
measurement of parent-child interaction in 
the playroom situation as a diagnostic pro 
cedure. In light of this, the inability to infer 
a causal relation between parent and child be- 
havior does not lessen the value of this te 
nique as a diagnostic procedure 

Before examining the diagnostic implication 
of this procedure, it is necessary to consider 
why several of the variables were not signifi 
Considering that the task of the parents 
was to elicit the help of the child in makir 
up storie 


cant 
a wide range of behaviors was not 
available to the parents, whether the child 
was a conduct problem 

group families, one way of coping with the 
situation was to manipulate the materials, to 
make up stories without the child. and to ask 
questions of the boy whenever possible. It 
was expected, however, that the NC families 
would give the boy more freedom 


yr not. Thus, for C 


This was 
not the case, and in terms of three of the cor 
trol variables, pd, pt, and ps, the NC families 
were as controlling as the C Coinci 
1.95: 


families 
dentally, a chi square analysis (,’ 
df= 1) of the combined pd and pt scores 
was completed. This suggested that the NC 
parents may have been more controlling than 
the C parents. The probability level (.20 > p 

10) achieved, however, does not allow us 
to feel certain in regard to this relationship 
It may well be that the C parents wanted to 
control their child, but could temper this, in 
“look he ¢ 
however, control their hostile and 
Becker Hell- 


mer, Shoemaker, and Quay (1959) have dem- 


ittempting to good.” parents 


could not 


rejecting feelings. As Peterson 


PARENTAL BEHAVIOR 113 
onstrated before, parents of conduct problem 
children are themselves more maladjusted and 
more freely exhibit their hostilities. However, 
given the problem of interesting the child in 
the task, both groups of families manifested 
similar types of behavior. 

Since the variables which were significant 
were highly correlated in both the case of the 
behavior categories and the rating scales, it is 
possible that we are dealing with a single fa 
tor upon which hostility and rejection hav 
high loadings. 

Finally, both groups were equally active i: 
the playroom, yet differed significantly on two 
major variables. This suggests, of course, that 
hostility and rejection are overtones which 
enter into the general and more lusive re 
lationship which parents of conduct problem 
children have with their offspring 

It was hypothesized that conflict a1 
tilities between the parents perhaps 
a model for the conduct probler 
havior. Contrary to expectatior 
no difference between the two g 
ents on the parent interaction vari 
illy, in neither group was 
interaction 


there ver 
between the parents. Sin 
parents were engaged in the task of 

the boy interested in the job at han 

haps hostility that might have been fel 
each other was directed toward the task and 
the boy. This would 
the C 


be particularly true o 
group parents, w! 


i need 
to express hostility in an uncontrolled fashion 

This may be exactly what happens 
home. While both parents may in their ow 
interaction present a model of aggressive and 
hostile behavior for the child, when they in- 
teract with the child they direct this hostility 


o seem to have 


in the 


towards the child. In one 
serves as a scapegoat 

The most unique aspect of this research is 
the fact 
aspects of parental hostility and rejectio1 
which have been talked about and studied for 
the most part in an indirect manner have been 
observed directly. Apparently, the fact of be- 
ing observed does not inhibit the pare nts’ be 
havior to such an extent that differ 
Thi 
the po 


sense child 


that in an observed situation, those 


primary 


ences between paren roups disappear 


observational techni monstrates 
J 


sibility of relating parental 


playroom behavior 
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with child behavior outside the playroom, and 
therefore becomes potentially important 
diagnostic technique. 

At its present level of development, the 
fixed playroom diagnostic procedure is an ex- 
cellent adjunct to other diagnostic techniques, 
and may aid the diagnostician in gaining a 
sharper picture of family dynamics. 


asa 


SUMMARY 


The particular focus of interest in this 
study was on differences in behavior between 
parents of conduct problem children (C) 
and parents of nonconduct problem children 
(NC). Two predictions were made: 

1. In families in which the child can be 
considered a conduct problem, the expectation 
is that the parents will exhibit significantly 
more control over the behavior of the child 
than will parents of problem 
children. 


nonconduct 


2. In the conduct problem children’s fami 
lies, the expectation is that there will be sig- 
nificantly more aggression between the par- 
ents than there will be between 
nonconduct problem children 

Forty-one males, 8 


parents of 


through 12 years old, 
and their parents served as subjects. The par- 
ents and their child were told that they were 
to construct stories to a scene which included 
a variety of buildings and people. While the 
parents and their child performed the task, 
two experimenters rated the parents’ behavior. 

A chi square analysis of the parent behav 
ior rating scales indicated that the C group 
parents were significantly more rejecting and 
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MoE LIs 


hostile toward their 
NC group parents 
The implications of these results were dis- 
cussed as they pertain to a cause and effect 
relation between parental hostility and rejec- 
tion and aggressive behavior in children. In 
addition to this, the merits of this playroom 
technique as a 
presented 


children than were the 


diagnostic instrument were 
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CROSS-VALIDATION OF THE CONTINUOUS PERFORMANCE 
TEST FOR BRAIN DAMAGE 


JEROME D. SCHEIN! 
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But if you see an A and the next letter is an X, 
then press on the X. Ready? 


To present the AX task, a visor on the viewer 
was moved which covered the previous letters and 
exposed a second series of 36 letters. This series 
contained six X’s following A’s and was repeated 
continuously for approximately 12 minutes or 20 
revolutions. Recording of the patient’s responses 
was accomplished automatically by impulse counters 
visible the examiner's side of the 
A separate score for each revolution was 
in addition to a total score for each tas 


on apparatus 


tained 


RESULTS 


Table 1 summarizes the data on age, in- 
telligence (Shipley-Hartford to 
W-B IQ equivalents) (Sines, and 
CPT scores. Analyses of variance were used 
to test the significance of the differences. The 
groups did not differ as to intelligence, but 
did differ significantly (p < .05 > .01) as to 
age. The product-moment correlation between 
age and CPT was —.10, making a 
covariance adjustment for this variable un- 
necessary. 

While a number of alternative means for 
scoring the test were tried, this report will 
confine itself principally to the 


converted 
1958), 


score 


“absolute 


score” (X-A, AX-A) found by Rosvold et al. 
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to be the most discriminating. This is ob- 
tained by dividing the number of correct 
responses by the number possible. Table 2 
presents the percentage of patients in each 
diagnostic category of this study compared 
with the adult normal (AN) and brain dam- 
aged (ABD) groups of the original study. In 
general, all of the patient groups in this 
study were superior in performance to the 
adults tested by Rosvold et al. On the X task, 
even the brain damaged patients tended to 
obtain higher scores than the normals that 
Rosvold studied. 

None of the to X-A 
scores. The remaining therefore, 
concern only the AX-A scores unless other- 
wise noted. The AX-A scores significantly 
differentiated the brain damaged 
patients and < .01 The BC 
patients were significantly different (p < .01 
from the PN and HN patients. The PS grou; 
differed significantly (p < .01) from the PN 
and HN patients 

Patients were classified further as to the 
medication they were taking at the time of 
testing. Those on depress 
pared to those without 
The 


differed 
analyses, 


groups as 


between 
others (p 


int drugs were com- 
medication by an 


analysis of variance differences were not 
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Cross-VALIDATION OF A BRAIN DAMAGE TEST 


rABLE 2 


PERCENTAGI Eacu DIAGNosTIC ( 


ROSVOLD, MIRSKY, 


ATEGORY OF 
SARASON, BRANSOME, AND Beck (1956) 


THIS StuDY COMPARED TO ADULT GR 


Diagnostic categories 


When the 
gory was further refined by comparing only 
taking chlorpromazine with those 
taking meprobamate or no drug within the 
PN category, the differences not 
significant. 

Forty-five patients 
available. These were divided into 
(N = 15), diffuse (NV = 14), focal (NV 
and borderline (NV = 3). The CPT 
of the EEG groups differed significantly 
(p < .05). However, only the diffuse group 
differed significantly from the others. Ad- 
ditional analyses did not support the alter- 
nate possibilities that the differences between 
the diffuse EEG group and other patients 
were due to differences in 
or drugs. 

The suggestion had made that 
patients with paroxysmal EEGs would per- 
form less well on the CPT than those with 
nonparoxysmal records (Mirsky, Primac, 
Stevens, & Ajmone-Marsan, 1958). Accord- 
ingly, the diffuse EEG group was divided into 
those having paroxysmal records (N = 4) 
and those without paroxysms (N = 10). 
A Mann-Whitney U test was not significant 
(U = 19, p>.0S). 

The reliability of the AX-A scores for all 
cases (VN = 127) was determined by corre- 
lating the odd 
drum with the even 


significant depressant drug cate- 


those 


were also 


had EEG tracings 
normal 
= £33. 


scores 


intelligence, age 


been 


revolutions of the 
\ product-moment cor- 
(uncorrected) of .96 


scores on 


relation was obtained. 


Following the Meehl 
Rosen (1955), a cutting line between AX-A 
scores of 90 and 91 appeared to give the 
optimal brain damaged 
patients in this sample. Of 53 brain damaged 
patients, 17 obtained scores of less than 91 
while 10 of the 71 non-brain-damaged 
patients obtained scores less than 91. 


and 


suggestions of 


differentiation of 


Since 6 
diagnosed as 
psychotic, the efficiency of the test 


of the 10 false positives are 
can be 
increased by withholding diagnoses from this 
test when a patient obtaining such a 
also appears psychotic. However, these in- 
terpretations of the scores should in turn be 
cross-validated on other samples. 


score 


DISCUSSION 

The analysis of the data on the CP 
tends to confirm in general the originators’ 
claim that it would differentiate brain dam 
aged patients from other groups. In terms of 
its clinical usefulness, however, the CPT was 
far from successful, being about equal in 
differentiating the 
brain damage (approximately 33% in 
hospital studied). 

The fact that the false positives, for the 
most part, tended to occur in the psychotic 
group leads to the suggestion that this test 
might be most useful in measuring the degree 
of attention 


rates for 
the 


power to base 


a patient can focus on a task 
Disturbance in attentivity is an 
both 


important 


indicator of psychosis, organic and 
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functional. For this purpose, the CPT should 
be useful to the clinician. 

The fact that only one-third of the brain 
damaged patients obtained pathological scores 
should continue to direct the clinician to the 
obvious inadequacy of any single instrument 
for detecting brain damage. Like the term 
“ill,” the term “brain damage” is too broad 
to be diagnostically useful. It is unlikely that 
damage from any cause, of any extent, in 
any area of the brain will have univocal con- 
sequences. Nor can the factors of duration of 
illness and age at which damage occurred bi 
ignored. 

Though EEG evidence of brain damage 
was correlated with the CPT the 
finding by Mirsky et al. (1958) that patients 
who have paroxysmal EEGs tend to 
lower than other patients was not confirmed. 
Two types of evidence indicate that failure 
to do well on the CPT is due to a persistent 
rather than an intermittent condition. First, 
extent of damage to be directly 
related to the CPT in that 
having diffuse EEGs differ from with 
focal EEGs, but with focal EEGs do 
not differ from those having normal trac 
Furthermore, the high 
odd-even CPT scores suggests that whatever 


Scores 


score 


appears 

score patients 
those 
those 
ings 
correlation between 
factor effects failure to respond does so con- 
sistently. An analysis of X scores yielded a 
similar result. Scores for the first five revo- 
lutions were compared to those for the last 
five. The difference was less than 
(failure to respond or pressing in 


one error 
corres tly 3 
and was not significantly different from zero. 

Thus, while the CPT be 
supply a n of 


able to 

brain 
damage, it does appear to be measuring at- 
tentivity and, conceivably, the related concept 
of memory. The inability of the simpler 
portion of the test (X) to discriminate be- 
tween groups suggests that continued devel- 
opment of this and similar tests should be in 
the direction of increased complexity. The 
fact that intelligence correlated only .26 
with AX-A scores indicates that the test is 
far from straining the intellectual capacity 
of most patients. The markedly skewed dis 


may not 


pathognomonic | si 
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of scores also 


tribution suggests that the 
test is much easy to provide adequate 
discrimination in its present form. 


too 


SUMMARY 


A version of the Continuous Performance 
Test (CPT) was administered to consecutive 
admissions (NV = 127) to a Veterans Ad- 
ministration hospital. Patients having cortical 
brain damage significantly less 
well (p < .01) than neurotic or neuropsychi- 


performed 


aiso 


performance 


atric normals. Psychotic patients were 
significantly poorer in their 

than neurotics patients. 
Patients with damage did 
not differ significantly from the other groups 
Medication within the 
range had no effect 


and hospitalized 


subcortical brain 
dosage 
Intelli- 


scores 
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on CPT scores 
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THE RELATIONSHIP OF COGNITIVE VARIABLES 
TO THERAPEUTIC ABILITY AND TRAINING 
OF CLIENT CENTERED THERAPISTS ' 


LEONARD E 


GOTTESMAN 


University of Chicago 


The ability of the psychotherapist to un- 
derstand the communications of his patient 
has been included in the theoretical formula- 
tions of almost every school of psychotherapy 
as an essential psychotherapeutic skill (e.g., 
Kelly, 1955a, 1955b; Rogers, 1957). Under- 
standing has also figured prominently in de- 
scriptions of the “ideal therapeutic relation- 
ship” as viewed by experts of the Freudian, 
Adlerian, and Rogerian orientations (Fiedler, 
1950a, 1950b). Furthermore, it has appeared 
as an important element of psychotherapy in 
several recent experimental studies (Barret- 
Lennard, 1958: Fiedler, 1950b; 
1951; Halkides, 1951). 

Among contemporary psychotherapy theo- 
rists,, both Carl Rogers and Kelly 
clearly imply their allegiance to the cognitive 
theoretical tradition (cf. Scheerer, 1951) by 
their statements that every person’s behavior 
is determined by the way in which he per- 
ceives and organizes his world. In view of this 


Grummon 


George 


position, the relationship of a therapist’s cog- 
nitive processes to his therapeutic ability and 
his therapeutic training assumes great impor- 
tance. This study is designed to test the cog- 
nitive functioning of trainees in client cen- 
tered therapy and to relate it to their ability. 
In addition, by comparing changes in the 
cognitive functioning of therapists in training 
with those of students being trained in The- 
matic Apperception Test analysis, the unique 
effects of therapy training on cognitive proc- 
esses may be isolated. Kelly (1955a) has sug- 
gested a model for describing these cognitive 
processes. He proposes that a person’s experi- 


The study on which this paper is based was sup- 
ported in part by funds from the Ford Foundation 
Psychotherapy Research Grant and was submitted 
as a dissertation in partial fulfillment of the require- 
ments for a Doctor of Philosophy degree at the Uni- 
versity of Chicago. 

2 Now at the University of Michigan. 


ence is organized into a number of bipolar 
personal constructs, each of which is defined 
by an object to which it is similar and an- 
other from which it is different. He has sug- 
gested that for the therapist two aspects of 
his system of bipolar constructs are especially 
important. The effective psychotherapist must 
have a large number of constructs available 
so that he can conceptualize a wide range of 
client behavior, and he must use his con- 
structs in a free and open manner so as not 
to limit the range of client behavior which 
may be included within a given construct 
Kelly calls these attributes “complexity” and 
“permeability,” respectively. 

Although the relationship of permeabilit) 
to therapist ability has not been investigated 
cognitive complexity has been the subject of 
some pertinent recent studies. Bieri 
has demonstrated (in undergraduates) a 
strong positive relationship between cognitive 
complexity (as measured by the 


(1955) 


degree ol 


differentiation of concepts ac hieved by sub 


jects in distinguishing significant other peo 
ple in their world) and accuracy of interper 
sonal prediction. By demonstrating a rela 


tionship between complexity measured in this 
manner and also as inferred from the percep- 
tion of ink blots, Bieri and Blacker (1956) 
have suggested that complexity is a pervasive 
personality Barron’s (1953) 
profile of the cognitively complex person con- 
forms closely to the ideal therapist as he 
would be described by Rogers (1951), by 
Kelly (1955a), and by Fiedler’s data (1950a). 
In Barron’s study the complex person meas- 
ured by the Barron-Welsh Art Scale is low in 
rigidity, friction, control of impulse by re- 
pression, authoritarianism, ethnocentrism, and 
social conformity, while being high in impul- 
sivity and lack of repression, independence of 
judgment, 
originality, 


characteristic. 


verbal fluency, expansiveness, and 
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It seems from Barron’s findings above that 
cognitive complexity correlates highly with 
those personal characteristics which both 
Rogers and Kelly value highly both in the 
generalized ideal person and in psychothera- 
pists. The question of this presumed relation- 
ship defines one of the main hypotheses of 
the research reported here. 

If complexity is related to therapeutic abil- 
ity, then it would be appropriate to discover 
if training in psychotherapy alters a trainee’s 
placement on the trait dimension of sim- 
plicity-complexity. Bieri and Blacker have 
suggested that complexity is a relatively deep- 
rooted characteristic of the person and that 
it is not likely to change readily. The second 
major hypothesis of this research is concerned 
with just this question. 

In addition to these main concerns, an 
auxiliary point of interest should be raised. 
Both complexity and permeability are formal 
characteristics of constructs. There are also 
characteristics of the application and content 
of a therapist’s constructs that may 
his ability and be affected by his 
First, training may have a strong 


relate to 
training. 
effect in 


determining which specific aspects of a client’s 


message the therapist chooses to conceptual- 
ize. He may have many or few constructs. He 
may use these loosely or rigidly, but he still 
must decide (on some level) what he feels is 
important. How much his decision is influ- 
enced or bound by the orientation of his 
teachers may be related to his ability as 

therapist. To approach this problem, thera- 
pists were also rated on the “popularity” 

their constructs (i.e., their tendency to make 
frequent use of constructs which were used 
often by other students in their class). These 
ratings were compared to their rated thera- 
peutic ability. In spite of its possible impor- 
tance, it was difficult to predict the direction 
of relationship of this variable to ability to 
do client centered therapy. While Rogers’ po- 
sition is strongly negative on the usefulness 
of diagnostic categories, it would also seem 
reasonable that a student’s exposure to a 
course of training in a given therapeutic ori- 
entation would, if he were a good student, 
teach him a specific jargon suitable to that 
orientation. For purposes of this study the 
hypothesis was stated in the positive direc- 
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tion (i.e., that the therapist’s ability would 
be positively correlated with his use of popu- 
lar constructs). As an additional measure of 
the effect of training, the “homogeneity,” 
or group agreement on conceptualization of 
stimuli, was also considered. 

To summarize, the specific hypotheses of 
this study were (a) that the degree to which 
a therapist’s constructs are permeable, com- 
plex, and popular bear a positive relationship 
to his rated therapeutic ability, and (b) that 
the result of training in client centered psy- 
chotherapy is to increase the permeability, 
complexity, popularity, and homogeneity of 
the constructs of the therapist group. 


METHOD 
Subjects 


The experimental group contained 13 students in 
a three quarter long graduate practicum in client 
centered counseling and psychotherapy at the Uni- 
versity of Chicago. The control group consisted of 
19 graduate clinical psychology students taking a 
course in TAT analysis. None of these students had 
ever taken the Counseling Center sequence or done 
any kind of psychotherapy. Since both the experi- 
mental and the control groups were composed of 
clinical psychology students at about the same level 
of training, comparing them offered a reasonable test 
of the hypothesized changes in therapist’s constructs 
as a result of the unique training in client centered 
therapy 


Instruments and Procedures 


The experimental instrument was a modification of 

e Role Construct Repertory Test (Kelly, 1955a, 

55h). In Kelly’s original version, the subjects were 
asked to generate constructs employing as stimuli 
significant other people in their world. In order to 
approximate the therapy situation more closely, the 
present study used statements made by clients in 
therapy as stimulus items. In exploratory work, four 
test forms containing 10 client statements each were 
devised, using quotations selected at random from 
several tapes in the files of the Counseling Center of 
the University of Chicago. Each client statement ap- 
peared in three different triads. Within each form 

3 Appreciation is expressed to John M. Butler and 
the staff and students of the Counseling Center for 
their cooperation with this research. 

The ages of the students were not considered. Also, 
since several studies (Hunt, 1951; Bieri, 1955; Bieri 
& Blacker, 1956) have shown no relationship between 
IQ and cognitive complexity measures similar to the 
one used in this study, and since the range of IQ in 
the subjects was highly restricted, IQ was not con- 
trolled. 
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the 10 statements were arranged randomly into 10 
groups of 3 statements each 

The present study, using two of the four forms 
and modified instructions, was based on the explora- 
tory work. Both the experimental and control groups 
were given both forms in group administrations at 
the beginning (before the therapy students had seen 
any clients, but after one quarter of didactic train 
ing) and at the end (6 months later) of the therapy 
students’ supervised therapy experience. 

Each subject had a test booklet containing the 10 
triads of client statements and an answer sheet con- 
sisting of a 10 X 10 matrix and space for recording 
the similarity and difference poles of 10 constructs 
With his test booklet before him the subject was 
presented with a triad of client statements in tape 
recorded and written form. He was asked to “decide 
on a single dimension which fits all three statements, 
along which any two of the three are alike, in any 
way that is meaningful to you as a therapist, but, at 
the same time, different from the third.” 

After he had decided on a similarity and differ- 
ence, he was asked to circle the numbers next to the 
two similar statements and then to write on his an 
swer sheet the way in which they were similar and 
the way in which the third statement differed from 
the other two. (This instruction varied from those 
used by Kelly in allowing the difference pole to be 
something other than the opposite of the similarity 
pole, but kept the notion that the two poles must 
be on a single dimension.) In all there were 10 triads 
presented and, therefore, 10 bipolar constructs gen- 
erated by each subject on each form of the test 

Following this, each subject was asked to rate all 
the written client statements on each of hi 
structs, in terms of whether the statement was at 
the similarity (S) pole of the construct, the differ- 
ence (D) pole, or could not be rated on the con- 
struct (X). The dependent variables were measured 
as follows: 


con- 


Permeability. Permeability (or openness of con- 
structs) was defined as the number of times the sub- 
ject applied his constructs to client statements. This 
was measured by a simple sum of S and D mark- 
ings in the 10 X 10 matrix of S, D, and X 
generated by each subject 

Complexity 


scores 


A subject’s complexity was measured 
by the number of different bipolar constructs he 
used. Duplications of constructs were not counted 
A construct was counted as a repeat if either the 
similarity or duplicated or nearly 


difference pole 


duplicated the exact wording of either pole of an- 
other construct (Bieri & Blacker, 1956). This method 
gave a theoretical maximum complexity score of 10 
and a minimum score of 1 for each form 


The complexity scoring was done by the investi- 
gator, without knowledge of the subjects’ ratings on 
therapeutic ability, but the reliability of the scorings 
was obtained by selecting 10 protocols at random 
for independent scoring by another judge. The in- 
terscorer reliability coefficient (Rho) for these 10 


protocols was 8&5 (p< 0s). 
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Popular constructs. All construct poles (ie., each 
half of the bipolar constructs) of the 13 experi- 
mental subjects and the 19 control subjects on both 
pre- and posttestings were categorized according to 
their content. This coding yielded 210 separate con 
struct categories. Four categories, however, accounted 
for over one-fourth of the total responses of the 
combined groups, and were separated from the next 
most popular category by 50 responses. These four 
categories (self, other, describing, and experiencing) 
were designated as popular, and each experimental 
and control subject was given a score for the num 
ber of popular constructs he used at each adminis- 
tration. 

Homogeneity. Homogeneity, a group character- 
istic, was measured in two ways. The first was sim- 
ply the mean percentage of subjects agreeing that 
two members of a triad went together, and the sec- 
ond was the percentage of the group that agreed on 
the pairing within a triad, that used the same con- 
cept for conceptualizing the triad. To arrive at this 
second percentage, Bieri and Blacker’s (1956) method 
for counting constructs was used. 

Criterion measures. Two measures of students’ abil- 
ity as client centered therapists were used as criteria 
for this study. For the first measure practicum in 
structors were asked to rate, along eight-point scales, 
each student’s overall ability as a therapist, his abil 
ity to understand clients, his ability to communicate 
his understanding to a client, and his ability to form 
a therapeutic relationship. In addition, three experi 
enced client centered therapists who did not know 
the students completed the same ratings after listen 
ing to a 10-minute segment of a tape recorded ther 
apy hour that each student had submitted as 
ample of his work 

Using Ebel’s (1951) intraclass correlation for com- 
puting the reliability of instructors’ ratings, inter 
rater reliability coefficients ranged from .85 to .89 
and were significant at less than the .001 level of 
probability. However, the scales themselves each cor- 
related with the others at .88 or above (p< .001) 
Ratings for each subject were therefore pooled across 
raters and scales and the subjects rank-ordered ac- 
cording to the pooled ratings 
called the “class ability” 


an ex- 


This composite was 
ranking 

The therapist judges agreed among themselves on 
their ratings of the subjects’ therapy tapes (r = .71, 
p< 001), and all intercorrelated at r= .64 
(p< 02). These ratings, too, were therefore aver- 
aged across raters and scales and then rank-ordered 
to give the “tape ability” ranking, which was used 
as a second criterion measure. This remarkably high 
agreement found in the instructors’ ratings could be 
due to a high level of communication about students 
among practicum instructors. The ratings, however, 
correlate highly with independent judgments made of 
tape recorded segments of students’ therapeutic work 
and, therefore, seem to have some validity. The basis 
for the high agreement will be the object of future 
research. 

Over all 13 subjects the class ability ranking and 
the tape ability ranking correlated at Rho= .68 (p 
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<.05). To increase the reliability of the criterion 
measure the two bases of rating were pooled into a 
“combined ability” rank. However, since the two 
rankings were far from perfectly correlated, the tape 
and class ability rankings were also considered sepa- 
rately. 

In both the experimental and control groups all 
subjects took two test forms at both administrations 
The scores on these forms were combined and all 
analyses made on the combined form, using non- 
parametric statistics. 

In summary, the hypotheses of this study con- 
cerned both the effect of training on two formal 
characteristics (complexity and permeability) and 
two content characteristics (popularity and homo- 
geneity) of the constructs of trainees in client cen- 
tered therapy and the relationship of the level of 
these formal and content characteristics after train- 
ing to the judged ability of the trainees. Implicit in 
the hypotheses was the expectation that the training 
would increase the level of these qualities in the 
trainees. 

RESULTS AND DISCUSSION 


Using Wilcoxon tests, comparison of the 
mean scores of the experimental and control 
groups on complexity and permeability failed 
to show any significant change over time for 
either group on either variable. Mann-Whit- 
ney comparisons of the two groups further 
showed no significant difference in amount or 
direction of change on either variable. Finally, 
Mann-Whitney tests showed no difference be- 
tween the groups at either pre- or posttrain- 
ing testing on either variable. 

A close similarity between the scores of 
the therapist and control groups at the first 
testing (Complexity X = 15.54 and 14.11; 
Permeability X = 39.6% and 49.5%) sug- 
gests that the subjects were selected from 
similar populations. It is, therefore, especially 
significant that, whatever the effects of train- 
ing, the findings clearly reject the hypothesis 
that it increases the permeability and com- 
plexity of trainees’ constructs. 

While the significance of Rho’s derived 
from small Ns is difficult to assess, the fact 
that the correlations reported in Table 1 are 
consistent with the predicted posttraining re- 
lationship between permeability and ability 
rankings is surprising in view of the findings 
reported in the paragraph above. Although 
there was no relationship between the cri- 
terion rankings and permeability before train- 
ing, after training, as predicted, there was a 
significant (Rho = .49, p < .05 one-tail) re- 
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TABLE 1 


RELATIONSHIP (RHO) OF COGNITIVE VARIABLF 
TO ABILITY CRITERION MEASURES 


Combined 
ratings 
re Post 
02 
O05 
09 


lationship between the combined class and 
tape ability rankings and permeability. The 
correlations of permeability to the rankings 
considered separately are in the predicted di- 
rection. 


The basis for these positive findings seems 
to be in the tendency of individual trainees 
to change in construct permeability, while 
the group mean remained constant. Absolute 
change in permeability correlated negatively 
with class instructors’ ability rankings, but in- 
crease in permeability was positively related 
to both class and tape ability judgments (Rho 
= .41, not 


significant). These findings sug- 


gest that there are some relationships among 


permeability, therapeutic ability, and the ef- 
fects of training, although they are appar- 
ently more complex than the direct monotonic 
relationship hypothesized in this research. It 
must also be emphasized, however, that the 
relationships found were weak ones. 

The relationships of construct complexity 
to ability (Table 1) are similar to those for 


nificant positive relationship (Rho=. 

< .05 one-tail) after training between the 
number of constructs a therapist used and his 
ability as rated by class instructors. The cor- 
relations with the other two criterion rankings 
are in the predicted direction, but small. None 
of the correlations before training differ sig- 
nificantly from zero. 

As with permeability, training did not in- 
crease the complexity of the therapist group. 
On this variable, however, there was also no 
significant change in the relative rankings 
of trainees over time (Rho = .81, p< .01). 
Nevertheless, with only 13 experimental sub- 
jects, the positive findings could be explained 
by an increase in complexity of only one or 
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two subjects with high ability rankings. The 
tendency toward a relationship between com- 
plexity and class instructors’ ability rankings 
(Rho = .23, not significant one-tail) which 
existed before training may thus have been 
increased (Rho = 49, p< .05 one-tail) by 
the training experience. Such a change would 
again support the possibility that training had 
an effect, but here, too, it must be emphasized 
that the correlations reported are quite low. 

The fact that there was a significant corre- 
lation between cognitive complexity and the 
ability rankings by class instructors, while the 
correlation between complexity and tape rank- 
ing was (Rho .14, not significant), 
raises the possibility that cognitive complexity 
may be related to factors in the instructors’ 
ratings other than the students’ actual thera- 
peutic skill. Barron (1953) cites verbal flu- 


le WwW 


ency as a characteristic of cognitively com- 
plex people; it is possible that therapeutic 
ability and verbal fluency may be somewhat 
confounded in instructors’ ratings. 
Mann-Whitney U tests 


popularity 


(comparing the sub- 


jects’ and homogeneity scores ) 


more 
before 


show the experimental group to use 


populars than did the both 
(p < .O1) after (/ 1) practicum 
training. (Experimental: pre A 11.15, post 
x 14. control: pre X post xX 
suggests 


6.31). The pretraining difference 


and 
fy f 5. 


that the one academic quarter of didactic in- 
struction which preceded the first testing had 
taught the an agreed 
set of symbols. A further increase in 
of populars, attributable to 


already trainees upon 
the use 
training, was also 
supported, however. The trainees used signifi- 
cantly more populars after training than be- 
fore (p < .05), while the control group did 
not change. 

This increased use of a limited number of 
constructs was not, however, correlated with 
high ability rankings by either class instruc- 
tors or tape judges. It remembered 
that, in the face of a conflict between Rogers’ 
negative position usefulness to 
therapists of popular diagnostic concepts, and 
the hypothesized effect of training in a spe- 
cific therapeutic orientation, a prediction was 
made that there would be a positive relation- 
ship between the use of popular, or stereo- 
typed, constructs and therapists’ ability rank- 


will be 


about the 
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ings. Table 1 shows that there is no support 
for this hypothesis. 
from 


The correlations ranged 
13 to .19 and were all within chance 
expectancy. Saying things fluently may have 
contributed to the correlation 

nitive complexity 


between cog- 


and ability, but, clearly, 
said was not in 


high 


saying what everybody else 
itself likely to 
ability ranking. 

This finding is supported by the effect of 
training on the homogeneity of the therapist 
group. Both before and after training the sub 
jects in the therapist group agreed more than 
did the control subjects on which statements 
should be 
conceptualized. Training changed the experi- 
mental subjects and did not affect the con- 
trols, but the effect of 
the therapists’ agreement 
stimulus pairing () - 5) 


help a trainee achieve 


were to be paired and how they 


training was to de- 


crease both on 
and on the con- 
struing of the agreed upon pairs (p 05) 


(The mean 


was 6] © pre 


agreement on pairing of stimuli 


and 56.6 post for experi- 


mental subjects and 5 “co pre 


and 53.6 
t for control subjects) 
Both 


tainty 


pos 
trainees and controls showed 


the il 


uncel 


about shifting in 


const? Ss DI 


rank ire posttesting 
Since the data show that the controls did not 


hile the therapist 


popularity 
increase in popularity, w 


group did, it 


1 


must be concluded that the in 
creased popularity of the trainees’ 


effect of the 


constructs 
The data 


Roge rs’ posi- 


is an therapy training 


are, however. consistent with 


tion in showing that neither high usage of 


popular constructs nor increase in populars 


contributed 


to high rankings on therapeutic 
ability. In light of this ; 
negative 


the therapist group must be considered an un- 


support for Rogers 


stand, the increase in populars in 


intended one so far as the course instructors 


were concerned. Perhaps despite their at 
tempt to discourage trainees from using diag 
nostic constructs, the instructors were getting 
constructs (self experi 
lecerihi co] Lente leas 

describing) which the students learned 


and used 


across a set of other, 
encing, 
From the findings on homogeneity 
it is evident that despite the increased uni- 
formity about which constructs to use, train- 
ing caused the therapists to decrease in agree- 
ment on how and when to use them. This de- 
crease may reflect the intent of the instructors 
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to train the students away from stereotyped 
responses by de-emphasizing all formal con- 
ceptual behavior. But then the implicit teach- 
ing of stereotyped responses, reflected by the 
increase in populars, was certainly a conflict- 
ing message. One wonders whether more ef- 
fort devoted to direct tuition in the applica- 
tion of constructs to a wide variety of client 
behavior would not have increased both the 
therapists’ ability to use their constructs and, 
consequently, their ability as client centered 
therapists. While the data suggest that this 
emphasis on teaching applications of con- 
structs might limit the cognitive complexity 
of the therapists (complexity and populars 
correlate at Rho = — .70), the findings re- 
ported suggest but a weak and questionable 
relationship between complexity and _ thera- 
peutic ability. By contrast, training in the 
application of constructs to client behavior 
would be likely to increase the openness (per- 
meability) of therapists’ constructs. Since 
both permeability and an increase in per- 
meability correlated significantly with thera- 
peutic ability, this would seem a highly de- 
sirable effect. 
SUMMARY 


Using a modification of the Role Construct 
Repertory Test, applied to the categorization 
of statements drawn from clients in therapy, 
this study measured two aspects of the form 
and two aspects of the content of the con- 
structs of a group of trainees in client cen- 
tered therapy and a control group of trainees 
in diagnostic testing. 

Of the formal characteristics, (a) perme- 
ability (openness of constructs) was corre- 
lated with rankings of therapeutic ability 
as rated independently by supervisors and 
trained therapists. Although there was no in- 
crease in the experimental group’s mean per- 
meability scores as a result of training, there 
was a shifting of ranks within the group such 
that for individuals an increase in perme- 
ability was associated with therapeutic skill. 
(5) Complexity (number of constructs) was 
related only to a judgment of therapeutic abil- 
ity, which may have been heavily influenced 
by a verbal fluency factor, and showed neither 
an increase attributable to training nor a 
shifting of ranks within the experimental 
group. 
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Of the content characteristics, (a) popu- 
larity (tendency to use a common body of 
constructs) was greater in the experimental 
group than in the control group, even before 
training, but it also showed an increase at- 
tributable to training. Neither increase in nor 
level of use of populars was correlated with 
therapeutic ability. (6) Homogeneity, a meas- 
ure of group agreement on the conceptualiza- 
tion of stimuli, showed a significant decrease 
as a result of training. (c) Popularity of con- 
structs was negatively related to complexity 
(Rho = — .70) but unrelated to permeability. 

It is suggested that, as a result of conflict- 
ing messages communicated by the instruc- 
tors, the trainees learned to use an implicit 
group of Rogerian constructs but became in- 
creasingly unsure of how to apply them. Since 
the data do not support a relationship be- 
tween complexity and therapeutic ability, it 
is proposed that direct tuition in use of con- 
structs, rather than implicit communication 
of an ambiguous body of constructs, might in- 
crease trainees’ confidence in using constructs 
and also increase their permeability and thera- 
peutic ability. 
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SEX DIFFERENCES IN THE EFFECT OF ITEM 
STYLE ON SOCIAL DESIRABILITY AND 
FREQUENCY OF ENDORSEMENT 


MARVIN R. GOLDFRIED ! 
University of Buffalo 


The influence of stylistic components of 
item form on responses to personality inven- 
tories has been emphasized in recent years. 
Most often, such variables have been viewed 
as sources of error to be avoided or mini- 
mized. Buss (1959) has noted that one ap- 
proach to controlling the influence of vari- 
ables other than item content would be to 
alter the manner in which items are written. 
He investigated several item styles (e.g., “I 
sometimes . »’ “I can’t help »” etc.) 
and found that social desirability ratings and 
frequency of endorsement were significantly 
elevated and depressed by certain styles. 

Recent studies by Cowen (1958) and Cowen 
and Stiller (1959) have indicated that sex 
differences play a significant role in the rat- 
ing of social desirability. The purpose of the 
present study, then, is to investigate the dif- 
ferential effect of item style on social desir- 
ability ratings and frequency of endorsement 
for males and females. 


METHOD 


The seven item styles and 10 item contents used in 
the present study were the same as those employed 
by Buss (1959). Ten aggressive statements were 
modified in style by the addition of the following: 
“Trouble controlling,” “Can’t help,” “Feel guilty 
about,” “Tempted to,” “Like most people,” and 
“Must admit”; the seventh style was the unelabo- 
rated statement. The 10 item contents were as fol- 
lows: 

I sometimes get into fights 
2. When I get mad, I say nasty things 
I tend to get into arguments. 

I tend to use strong language. 

When I get angry, I sometimes pick up the 
nearest thing and break it 

I tend to be rude to people I don’t like. 


t 


ae o 
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7. I sometimes have outbursts of temper. 

8. When I don’t like a rule, I tend to break it. 
9. I tend to make fun of others. 

10. I dislike many people I know. 


A 70-item list resulted from writing each of the 
10 items in each of the seven styles. These items were 
rated on a nine-point social desirability scale (maxi- 
mum social desirability at nine and maximum social 
undesirability at one) by 50 male and 50 female stu- 
dents enrolled in an introductory psychology course 
The 70 items were then divided into seven 10-item 
inventories with no content repeated in a single set; 
each inventory contained the seven styles, with three 
styles repeated. Frequency of endorsement was de- 
termined by administering these inventories to 210 
male and 210 female college students who responded 
“true” or “false” to each 
were different from thos« 
sirability. The N for 
was 60 (30 males and 


Statement 
who 


These subjects 
for social de- 
the seven inventories 
females). 


rated 


each ol 


RESULTS AND DISCUSSION 


Table 1 presents the results of an analysis 
of variance of the effect of item style and sex 
on the social desirability ratings. The analy- 
sis used was that for the Lindquist Type I 
“mixed” design (Lindquist, 1953). 

The analysis demonstrated no significant 
interaction between style and sex, indicating 


rABLE 1 


ANALYSIS OF VARIANCE OF EFFECT 
AND SEX ON SOCIAL DESIRAB 


F Irem Sty. 
{LITY RATINGS 


Source of variance WS 


Between-Subject 
Sex 
Error (b 
Within-Subjects 
Style 
Sex X Style 
Error (w) 


76.03*** 


0.64 


Total 


* Significant at the .001 
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STYLE, AND SOCIAL DESIRABILITY 


TABLE 2 


MEAN SocrAL DESIRABILITY 


Buss (1959 
Goldfried 


stvle did not change social desirability 


ratings differently for the two sexes. Accord- 


ingly, the simple effects were not tested. The 


analysis revealed that style significantly af- 
fected social desirability ratings. Table 2 pre 
sents the mean social desirability values for 
the different item styles and the results com- 
paring the means for the 
with the mean for the unelaborated style 
While Buss used ¢ tests, Duncan’s (1955) 
multiple range test was used in the present 
investigation inasmuch as the six compari- 
mutually orthogonal and it 
might be questioned whether a suffi- 
cient a priori basis for limiting the possible 
comparisons to these six. 


six elaborated styles 


sons are not 


there is 


Although four styles changed ratings in the 


direction of greater social desirability in the 


Buss study, only one style had this effect in 
the present investigation. The fact that the 
Guilty style consistently raised social desir- 
ability suggests that admission of guilt to a 
hostile act in part vitiates the social undesir- 
ability of the act itself. The reader will note 
that the 


mean social desirability values for 


rABLI 


VALUES FOR 


DIFFERENT Item STYL!I 


Item style 


he different styles 
differ only 


+ 


obtained in this study 
from those of Buss. In 
deed, when the means for the elaborated style 
ind the unelaborated style were compared 
using ¢ test, 


slightly 


two additional styles (Trouble 
controlling and Tempted) appeared signifi 
cantly to increase social desirability. As indi 
cated, however, it was not felt that f 
are appropriate in this situation 

Chi square was used to test the significance 


tests 


of the differences between the mean frequen- 
cies of endorsement for the six elaborated 
styles and the unelaborated style. Because 
each subject responded to all seven styles 
the scores for the elaborated and unelabo- 
rated styles are not independent and there- 
fore could not be compared simply. For each 
10-item inventory, the three repeated styles 
were discarded. The 210 males and 210 fe- 
males were separately and randomly divided 
into two subgroups (I and II) for each of 
the seven styles (A through G, the unelabo- 
rated). The comparability of Subgroups I and 
II was demonstrated by calculating 14 two- 
cell chi none of which 


squares, was signifi- 


5 
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cant at the .05 level. The differences between 
the mean frequencies of endorsement for the 
elaborated and the unelaborated styles were 
then compared, separately for the two sexes, 
as follows: A; X Gy, By X Gy, Cr X Gu, Du 
x G,;, Eu X Gy, Fr X Gy. The results of this 
analysis are presented in Table 3, where the 
percentages are based on I and II combined. 

It will be noted that the present findings 
differ from those of Buss. While the Must 
admit, Can’t help, and Guilty styles signifi- 
cantly influenced frequency of enforcement 
in the Buss study, they had no such effect in 
the present study. The Like most people and 
Tempted styles, which increased frequency of 
endorsement in the Buss study, had this ef- 
fect only for females in the present study. It 
appears that females are more likely to en- 
dorse an item if it appeals to a consensus or 
emphasizes impulses but not necessarily overt 
action. The Trouble controlling style had no 
effect in the Buss study but significantly de- 
pressed frequency of endorsement for male 
subjects in the present investigation. While 
this would suggest that males are reluctant to 
admit lack of control over their impulses, it 
is difficult to account for the failure of the 
Can’t help style, which implies no active ef- 
fort to maintain control, to decrease fre- 
quency of endorsement. Apparently males are 
more likely to endorse a statement implying 
that no effort is made than one implying un- 
successful effort. 

It should be pointed out that changing the 
style in which an item is written may have 
the effect of altering the content of the item 
For example, the statement, “T must admit 
that I sometimes get into fights,” implies that 
the individual responding “true” actually acts 
on his aggressive impulses, while the state- 
ment, “I am tempted to get into fights,” im- 
plies that aggressive impulses are present but 
not necessarily acted upon. The question may 
be raised whether differences between two 
such statements, in terms of social desirability 
ratings or frequency of endorsement, should 
not be attributed to a difference in content 
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rather than a difference in style. It should be 
noted, too, that the present findings on the 
effects of item style hold only for aggressive 
statements. It is quite possible that different 
results would have arisen had the items dealt 
with other kinds of behavior. In this connec- 
tion, a comparison of the frequency of en- 
dorsement of the unelaborated statements by 
male and female subjects yielded a chi square 
significant at the .05 level. Thus, in keeping 
with expectations, males endorsed more ag- 
gressive statements than did females. 


SUMMARY 


This study investigated the differential ef- 
fect of item style on social desirability and 
frequency of endorsement for males and fe- 
males. Ten aggressive statements were writ- 
then in seven different styles. Social desir- 
ability ratings and frequencies of endorsement 
were studied in groups of college students. 

Both social desirability and frequency of 
endorsement were found to be influenced by 
item style. The influence differed for the two 
sexes in the case of frequency of endorsement 
but not in the case of social desirability rat- 
ings. Social desirability was increased only by 
the Guilty style. Frequency of endorsement 
was depressed by the Trouble controlling 
style for males and increased by the Like 
most people and Tempted styles for females. 
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DEPENDENCY OF PATIENTS, PSYCHOTHERAPISTS’ RESPONSES, 


AND ASPECTS OF 


L. WINDER, FARRUKH Z. AHMAD, 


Stanford 
Interactions of patients with psychothera- 
pists are studied as these relate to two facets 
of psychotherapy: (a) continuation or termi- 
nation of treatment, and (6) content 
quences in the interviews. Patients studied 
are parents who sought help at a child guid- 
ance clinic because of emotional or adjust- 
ment problems of a son or daughter. The 
psychotherapists were graduate students in 
clinical psychology. All patients received in 
dividual treatment. The data are from the 
two initial psychotherapy sessions. The mat- 
ter of basic interest is the apparent effects of 
reinforcement by the psychotherapists of ver- 


se 


bal expressions of dependency by patients 
Hypothesis I. lf during the initial phase of 
treatment expressions of dependency need or 
the part of the patient are 
tively reinforced ) 


ipproached (pos 


than avoided 


by the psychotherapist, the patient will 


more olten 


tend 
to remain in treatment: but, if 


} " 4 : f 
there 1 nire 


quent approach as compared with avoidan 


the patient will tend to avoid treatment 
This 7 


that human development generates « 


hypothesis evolves from the thesis 
lepend 
ency need. These patients, in bringing a prol 
lem child to i 


ina depe na- 
ent manner. They tend to remain if the treat- 
ment relationship provides gratification 
dependency need, but tend to avoid treat- 
ment which involves relatively 


qe 4: 
a Cilnic, are acting 


of 
much frustra 
tion. Further, if psychotherapy involves dis- 
tress and discomfort, e.g 


through arousal of 
personal feelings and thoughts which are anx- 
iety and guilt provoking, gratifications in the 
treatment would be necessary to permit the 
patient to remain. Some suggestive evidence 
that more patients of warmer psychothera 
pists stay in treatment is available (e.g., 
Hiler, 1958). In discussions of psychothera- 
peutic techniques and processes, the impor 
tance of the warmth of the psychotherapist 
has been emphasized repeatedly (e.g., Raush 
& Bordin, 1957; Wolberg, 1954) 
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Hypothesis 11. lf expressions of dependency 
are approached by the psychotherapist, the 
relative frequency of such expressions will be 
sustained or will increase; but, if expressions 
are avoided (nonreward or negative reinforce- 
ment), the frequency of occurrence will de- 
crease. This hypothesis evolves from rein- 
forcement theory. Essentially the same hy- 
pothesis has been confirmed in regard to 
the aggression behavior system by Bandura 
Lipsher, and Miller (1960) 


METHOD 


Sub Th 


and 7 


tape 1 led int iews of 16 


fathers who commenced psychotherapy at 


moth 


a child guidance clinic provided data for this study 
These patients were seen by 5 female and 1 
hotherapists 
in clinical psy 
I doctoral level 


clinical ps 
of the psychotherapists each saw one of thi 


patients and six psychotherapist aw two patient 
of the sample 


Phase of treatment sampled. The first two psycho 
therapy sessions were subjected in thi 
study. All had participated in intake inter 
and diagnostic sessions, often with 
clinician who became the psychotherapist. There was 
however, an administrative separation between the 
diagnostic phase and the beginning of the treatment 
phase in the sense that the verbal contr to treat 
made until the 
completed. The content analy- 
sis, then, is of the beginning phase of psychotherapy 
having the context mentioned 
Duration of Two groups patients 
were formed on the basis of the length of time they 
remained in treatment. The “terminator group” 
the six patients who 
treatment, but who left after 
sions and without concurrent 
chotherapist and supervisor 
attended a of 
continuing well beyond 
There are 17 
entered treatment 
included 
who had 
reality 


to analysis 
patients 
views the 


same 


act 
and be treated, respectively, 
diagnostic phase was 


was not 


treatment of 
con 
ists of commenced 
than 10 ses 
of the psy 
‘remainer group” 
most 


definitely 
no mor 
agreement 
The 
sessions 
number 
group 


minimum and were 
that 


this 


interviews 
All parents who 
a specified period of time 
for 

left 


earner 


ol 
in 
during 
in the study 
than St 


cases 


were except three cases 


fewer ssions who treatment 


reasons such as wage being 


transferred to a distant city by his company 
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Coding of patient-psychotherapist interactions. The 
system used is entirely analogous to that presented 
in detail by Bandura, Lipsher, and Miller (1960) 
Elements of the system are as follows: 

1. The scoring unit: A unit is an interaction se- 
quence consisting of the beginning patient statement, 
the intermediate psychotherapist response, and the 
immediately following patient response. 

2. Patient behavior 
were dependency 
sion of 


categories: Categories scored 
(which is defined as any expres- 
approval-seeking, information-seeking, de- 
mand for initiation of activity or discussion by the 
psychotherapist, help-seeking, company-seeking, con- 
cern about disapproval, or agreement with others) ; 
hostility; and other categories as listed in Table 1 
When dependency units occurred, the object of the 
patient’s behavior was also scored as either psycho- 
therapist or other 

3. Approach by the psychotherapist: These are the 
reactions of the psychotherapist which are 
to elicit from the patient 
the topic under discussion. The particular categories 
are listed in Table 1. Instigation of 
any particular topic, e.g., 


designed 
further verbalization of 
discussion of 
dependency or 
coded as an approach to that topic 

4. Avoidance by the psychotherapist: These are 
the reactions by 
signed to 


hostility, is 


the psychotherapist which are de 
inhibit, discourage, or divert the patient 
from further verbalization about the topic under dis- 
cussion. The categories are listed in Table 1 
Retractions by the psychotherapist were also noted, 
and there was a miscellaneous category for psycho 
therapist responses which could not be 
the above categories. It 


assigned to 
is important to note that 
such sounds as “mmm-hmm”’ often are not included 
in either the approach or avoidance 
psychotherapist behavior and often are treated as in- 
consequential in the system as employed in this study 
and in the Bandura, Miller (1960) 
study. The analysis is similar in a general way to 
the so-called verbal conditioning paradigm, but the 
range of verbal reinforcers taken account is 
wider than is generally true, and the definition of 
the unit of patient behavior is 


cate yori Ss of 
Lipsher, and 


into 


somewhat unique 
Facial expressions and other gestures were not avail- 
able for coding, as tape recordings were analyzed 

Agreement among raters. Three raters, A, B, and 
C,1 were involved in this aspect of the project 
Raters A and B, after a training program utilizing 
interviews from other cases, scored 10 psychotherapy 
sessions independently. Agreement was satisfactory 
except for two subcategories of avoidance, i.e., ignor- 
ing and topical transition (see Table 1). The dis 
agreement resulted primarily from the infrequency 
with which Rater B coded such events. The cases 
selected for coding by Rater B were chosen so that 
approximately every fourth interview coded by Rater 
A was coded by Rater B. Had Rater A tended to 
shift in coding, this would probably have been re 
flected in disagreement with Rater B 

1 The authors wish to express their appreciation to 
Donald Hartmann and M. Zoberi for their assistance 
in coding interview data. 
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Since there was the agreement between Raters 
A and B on ignoring and topical transition, a third 
rater, Rater C, was trained. Rater C was given five 
terminator and five remainer cases to code. This se- 
lection of cases was made in order to obtain data on 
rater agreement on a set of cases which would be 
critical in the evaluation of the first hypothesis. The 
particular selection of cases was made also so as to 
sample the total sequence of coding by Rater A 
Agreement Raters A and C was satisfac- 
tory. Where agreement between Raters A and B had 
been high, and the fre of the event was very 
low, certain coding categories were omitted for Rat 
C, and these are noted in Table 1 

Since only Rater A coded all cases, the hypotheses 
are evaluated on the basi f the 
Rater A 


between 


coding done by 


RESULTS 


Agreement between raters. A first approach 
in assessing agreement between raters is in 
terms of agreement on elements of each unit 
coded. Raters may agree with regard to all 
elements of cach unit, or may disagree with 
(The ele 
ments of a typical unit would be (a) the pa- 
tient’s initial statement 
pist’s reinforcement, (c) the patient’s subse 
quent and (d) the object toward 
whom dependency is directed.) There were 
563 units scored by either Rater A or B, and 


regard to one or more elements. 


(b) the psychothera- 


response 


561 units scored by both. There was perfect 
agreement on all elements of 414 units. Raters 
A and C scored 388 of 


583 total units in ex 
The proportion of all 
scored units on which there was perfect agree- 
ment was .74 for Raters A and B, and was .67 
for Raters A and C. Similar proportions where 


actly the same way 


there was disagreement on only one element 
were .23 and .24. Thus, the bulk of units fell 
into the perfect agreement or one element in 
disagreement categories 

The approach in assessing agree- 
ment between raters is summarized in Table 1 
Rater agreement was determined for each spe 
cific category of response which was scored it 


second 


the study. Categories which are critical in the 
evaluation of the hypotheses are: dependency 
approach, and avoidance, with hostility being 
of definite interest. The object of dependency 
is an important category, also. Level of agree 

tended to be satis 
factory for all aspects of the scoring system, 
but it 


ment on specific categorie 


is the above categories which are of 
greatest interest. Proportion of agreement on 
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rABLE 1 


FREQUENCIES OF CATEGORIES, AND AGREEMENT 


OF RATERS 


I nits scored 
for total sub % Agreement, 
jects by Raters 


L\&B 


Avoidance 
Disapproval 
Silence 
Mislabeli 
Ignoring 


lopic trar 


Retraction 


Miscellaneou 
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dependency elements is .91 (294 total units) 
for Raters A and B; and is .92 (234 total 
units) for Raters A and C. For approach, 
analogous proportions are .84 and .98; for 
avoidance, .44 and .81; for object, .89 and 
.68; and for hostility, .89 and .83. 
Overview. From Table 1, relative frequen- 
cies of occurrence of various categories can be 
assessed. Note that ignoring and topical tran- 
sition are treated as a combined category. 
Hypothesis I. For each patient, the num- 
ber of times the psychotherapist approached 
or avoided the patient’s expressions of de- 
pendency was determined, and the ratio of de- 
pendency approached to dependency avoided 
was calculated. The prediction was that these 
scores would be higher for remainers than for 
terminators. A score of 1.00 would indicate 
dependency approached is equal to depend- 
ency avoided. Using 1.00 as the cutting score, 
the distribution of terminators and remainers 
is shown in Table 2. The Mann-Whitney U 
test indicates that such a result on the basis 
of chance would be unlikely (p < .01). 
Separate analyses were made for psycho- 
therapist’s responses to dependency when the 
psychotherapist was the object and when the 
dependency was directed toward some other 
object. While the theoretical position does not 
require the prediction that it is the responses 
by the psychotherapist to expressions which 
have him as an object which will be most 
strongly related to the events of treatment. 
this emphasis is in keeping with the point of 
view guiding the study. The approach-avoid- 
ance ratios for dependency units with and 
without the psychotherapist as the object 
were calculated. With the psychotherapist as 
object, the results are the same as those in 


TABLE 2 
PSYCHOTHERAPISTS’ Appt 
TOWARD 


REACT! 
TERMINATORS AND 


MACH—AVOIDANCI 
DEPENDENCY FOR 
REMAINERS 


\pproac! Approacl 
\voidan 
than 1.00 or more than 1.00 


\voidance 


(,roup 


Terminator 


Remainet 


Note.—Mann-Whitney U test: p < .01 
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Table 2. If some other was the object, there 
were two terminator cases where approach ex- 
ceeded avoidance of dependency and three re 
mainer cases where avoidance exceeded ap- 
proach, a set of frequencies which could 
easily be ascribed to chance (p > .05 for the 
Mann-Whitney U test). 

Hypothesis II. For each patient, there was 
a determination of the proportion of depend- 
ency bids followed by approach which were 
in turn followed by a further expression of 
dependency. The mean proportion is .68, in 
dicating that the tendency is for approach to 
dependency to be followed by further expres- 
sion of dependency. Similarly, the mean pro 
portion of dependency bids followed by avoid- 
ance which were then followed by a furthe1 
expression of dependency is .11 

For each patient, the two proportions—de- 
pendency after approach to dependency, and 
dependency after avoidance of dependency 
were compared. In 20 of the 23 cases, the 
proportion of dependency following approach 
to dependency exceeded the proportion of de- 
pendency following avoidance of dependency 
The sign test 
significant, p= .001. Ir 


for this set of comparisons is 


1 18 of the 23 cases 
when approach followed dependency, further 
dependency then followed more than 50% of 
the time; and in 
then followed 
When 
avoidance, dependency never 
next patient behavior in 1 


in no case did dependency 


14 of the 23 cases, depend 
ency 
time. 


than 75% of the 
followed by 
occurred as the 


more 
dependency was 
of the cases; and 
avoidance 

of the time 
represented by 
units 
The mean proportion 
aggression 


follow 
of dependency as much as 5 

The other behavior system 
any substantial frequency of was ag 
gression (see Table 1) 
of instances when 


proach of 


followed ap 
a and that for 
aggression following avoidance of aggression 
02. In all 23 the proportion for 
aggression following approach of aggression 


ggression was .51, 


was cases 
exceeded that for aggression following avoid- 
ance of aggression. This difference, tested by 
the sign test, was, of course 
cant (p < .001). 

In interpreting the findings for Hypothesis 
I, it is 1 


Clearly signifi 


elevant to inquire whether remainet 
and terminator groups differed on total units, 
since there is a correlation between total num- 
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ber of units, and number of dependency units 
(yr = .79). There was no statistical support 
for the hypothesis that terminators and re- 
mainers differ in total number of units (¢ test 
of difference between means and Mann-Whit- 
ney U test both insignificant). Neither is 
there any evidence that the groups differ in 
total number of dependency units initiated 
(units which are not preceded by reinforce- 
ment of a prior dependency unit). 

Other general issues require attention. There 
is the matter of the total approach to total 
avoidance ratio for all types of patient be- 
havior. Terminator and remainer groups were 
compared on this measure, and there is no 
evidence of a difference. Similarly, the groups 
were compared on total approach, total avoid- 
ance, and the psychotherapist approach-avoid- 
ance ratio for instances of expressions of hos- 
tility by the patient. There is no statistical 
evidence of differences between groups on any 
of these measures. 


DISCUSSION 
All findings in this study must be viewed as 


preliminary because of the small number of 


cases. Furthermore, generalization of the find- 


ings to psychotherapists in general, and to 
patients in general would be unjustified. The 
study does offer a point of departure for fur- 
ther analyses of psychotherapy 

Hypothesis I. lf expressions of dependency 
on the part of the patient are approached by 
the psychotherapist, the patient tends to re- 
main in treatment. If, on the other hand 
there is little positive or even negative rein- 
forcement of dependency, then the patient 
will tend to avoid the treatment relationship 
This hypothesis is supported by the data, 
particularly when the psychotherapist is the 
object of the patient’s dependency. Since there 
was no evidence that the terminator and re- 
mainer groups differ on total units of behav- 
ior, total dependency units, total approach ex- 
perienced, total avoidance experienced, total 
approach-avoidance ratio, and psychothera- 
pists’ management of hostility, the inference 
to be drawn is that management of expres- 
sions of dependency is the critical feature re 
lated to remaining in treatment 

Should this two of the 
possible explanatory factors to be 


be correct, major 


assessed 
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would be patient characteristics and psycho- 
therapist characteristics. Considering the lat- 
ter possibility, it is relevant to note that three 
of the terminator cases were treated by psy- 
chotherapists who did not have any of the 
remainer cases. But, three of the terminator 
cases had psychotherapists who did have re- 
mainer cases. If all terminator cases had been 
treated by psychotherapists who had no re- 
mainer cases, the plausibility of an explana- 
tion in terms of psychotherapist character- 
istics would be higher. That three psycho- 
therapists had both terminator and remainer 
cases casts doubt on this as the only factor 
involved. In future work with larger sam- 
ples, approaches such as that employed by 
Bandura, Lipsher, and Miller (1960) should 
be utilized in clarifying the importance of 
such characteristics as dependency-anxiety of 
the psychotherapist 

There is one bit of indirect evidence sug- 
gesting that attached 
to patient characteristics in this process of 
expression and reinforcement of dependency 
Two expert psychotherapists listened, inde- 
pendently, to the recording of the interviews 
of five terminator and five remainer 
These were ranked for over-all 
the case, assuming that general, eclectic out- 
patient psychotherapy principles were fol- 
lowed by the psychotherapists. The rho be- 
tween the rankings by these judges was .87 
(p = .01). Four of the five terminator cases 
were among the five ranked as most 
difficult on the basis of combined ranks of 
the judges. One plausible inference would be 
that the cases make dependency bids which 
were deemed unapproachable in view of usual 
psychotherapeutic concepts. 

Obviously, an explanation ascribing impor- 
tance to patient variables, psychotherapist 
variables, and to interaction effects may well 
result from further investigation. In particu- 
lar, Munson (1960) has provided evidence 
which is relevant, here. 

Hypothesis II. If expressions of depend- 
ency are approached or positively reinforced 
by the psychotherapist, the frequency of such 
expressions will be sustained or will increase; 
and if expressions are avoided or negatively 
reinforced, their frequency will decrease. This 
hypothesis is clearly supported by the data. 


some importance be 


cases 


difficulty of 


cases 
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The results are analogous to those obtained 
by Bandura et al. (1960) regarding hostility. 
Results from the present study also support 
the hypothesis in regard to hostility. But, one 
must keep in mind that it is the initial and 
short term effects of positive reinforcement 
which are studied here, and that the same is 
true of so-called verbal conditioning studies 
It may be that in the long run, repeated ap- 
proach on the part of the psychotherapist 
might have rather different effects, if indeed 
psychotherapists do continue to respond with 
approach. If approach were continued at a 
high relative rate, patients may come to per- 
ceive the psychotherapist as controlling; 
simply become submissive and progressively 
more expressive of dependency, aggression 
and other reinforced categories; may become 
anxious or, in line with the catharsis hypothe- 
sis, there may be reduced expression regard- 
ing topics with repeated ventilation. 


may 


SUMMARY 


Two hypotheses were studied. First, if dur- 
ing the initial phase of treatment expressions 
of dependency need on the part of the pa- 
tient are approached more often than avoided 
by the psychotherapist, the patient will re- 
main in treatment; conversely, if there is in- 
frequent approach as compared with avoid- 
ance of expressions of dependency, the pa- 


AHMAD, BANDURA, AND RAvU 


tient will avoid the treatment relationship. 
Second, if expressions of dependency are ap- 
proached by the psychotherapist, the fre- 
quency of such expressions will be sustained 
or will increase; but, if expressions are 
avoided, their frequency will decrease. Both 
hypotheses were supported. The second hy- 
pothesis, as applied to expressions of hostility, 
was also confirmed. 

Further investigation that the 
form of the expressions of dependency by 
terminators and remainers may differ. It is 
suggested that some forms of expression may 
provoke avoidance if the psychotherapist fol 
lows conventional notions about proper psy- 
chotherapeutic technique 


suggested 
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THE VERBAI 
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In the past several years there has been 

increase in the number of attempts to 
study the process of psychotherapy objec- 
tively. Much of this pertinent literature has 
been surveyed in two articles, those by Auld 
and Murray (1955) and Frank (1961). The 
present investigation’ is an extension of an 
approach originally used by Snyder (1945) 
to study the verbal content of therapeuti 
sessions. The assumptions justifying these ob- 
jective studies, by now validated by the lit- 
erature, is that there is consistency between 
the verbalized material of each of the parties 
in the therapeutic relationship. The goal of 
the present investigation was to extend knowl- 
edge of the nature of these consistencies. Ulti- 
mately, the purpose of such studies of the 
therapeutic process should be to help facili- 
tate the progress of psychotherapy. 


an 


PROCEDURE 


Forty subjects were used in thi 
included patients at a 
hygiene clini 
sity and 


These 


mental! 


experiment 
iffiliated 
local 


agencies. The avers 


university 


and volunteers from the univer 


government fe inte 
1 This investigation is part 
therapeutic process done at i 
by A. Belden (1956), J Fogel 
1956), and W. Lyon (1956 
Anders Sweetland, then 
lraining Program 
2 The methodological differences bet 


the 
Director of 


under 


supe 
ul 


the 


rvision 


Clinical 


ween this study 
and that by Snyder are worthy of some clarification 
For example, Snyder used the 


and 


data from six clients 


four counselors; the present study included the 
clients and 4 counselors 
that 

a 


recordings as wel 


a trom 40 


] 
icluded 


Snyder’s 


material was gleaned from ver 
electrical 


as stenographically re 
corded notes 


made by a hidden observer listening 


through a public address system, whereas all of the 
material in the present study comes from electrically 
recorded interviews. The interviews that 
varied for each case; case analysis included the 
first 10 interviews, another, the first 13, the other 
two apparently ranged between 5 and 13 interviews, 
this is not made explicit, for a total of 48 hours. The 
present data include only the first 4 hours from the 
40 subjects, hence a total of 160 hours. 


Snyder 


used 
one 


PROCESS OF 


PSYCHOTHERAPY: 


INTERACTION 


ANDERS SWEETLAND 


RAND Corporation 


gence of the subjects was estimated from the V« 
cabulary the Wechsler-Bellevue and wa 
found to be 115, with none below an estimated IQ 
of 100. The age range of the subjects was from 1 
to 35 years. There were 19 males and 21 femal 
No individuals with a diagnosis of psychosis wert 
included. 

The interviewing was done by four people, all of 
whom had completed their academic 
leading to a 


score of 


requirements 
*hD in psychology with specialization 
in the area of clinical. In addition, all 
mum of 1 year’s experience in 


had a mini 
a clinical setting. In 
broad terms, the interviewing approach was Rog 
rian, the major exception being the “Forcing In 
sight” techniques.2 Each of the four 
interviewed 10 subjects 

The first 4 hours of the interviews were tape-ri 
corded yielding a total of 160 hours to be analyzed 
The data on the tapes were then coded using a modi- 
fication of the Snyder (1945) categorization. The 


procedure ol 


experimenter 


analysis was as follows: In 
t juence: 

1. The client 
statements 


makes a statement or a series of 
2. The therapist then makes the response that he 
deems most appropriate. 
The client then makes a statement or a seri 
of statements. 


For example: 


Client: I’m afraid to get married. 

Therapist: I wonder what causes these feelings? 

Client: I know I would like to be married, but 
I’m afraid. I think my parents’ quarreling all the 
time made me afraid 
would be like that. 


I’m afraid that my marriage 


This particular sequence would be scored: 
Client: Unpleasant Affect ir 
Therapist: Forcing Insight I 


Client: Unpleasant Affect in the Present, Tenta 
tive hypothesis, Unpleasant Affect in the Present 


the Present 


3 The word “forcing” is a misnomer; 


gentle prod- 
ding would be more appropriate 

4 The categories used by Snyder were modified to 
provide for more specificity and breadth in the cate- 
gorization of both the client as well as the therapist 
data. For example, though Snyder analyzed the af- 
fective tone of the client’s statements, he did this 
separately from the analysis of the “content,” herein, 
the client’s statements reflecting affect were consid- 
ered part of the total number of possible client re- 
sponses to the therapist’s statements. 
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TABLE 1 


TABLE 


oF Cut SQUARES REPRESENTING AN ANALYSIS OF THE INTERACTION 


BETWEEN THERAPIST AND CLIENT STATEMENTS 


Talk 


about 


statements 


ot 
Therapist statements Problems Sy 


Direct Question +110 —13 

ziving Information 0 0 

0 +4 
+34 
= Acceptance +1 
Forcing Initiative -30 
Nondirective Leads +14 
Reflection —1 
Clarification —49 
Forcing Topic N +7 
Forcing Topic D 0 
Forcing Insight I —63 
Forcing Insight II —57 
Interpretation —204 


—14 
+17 


Note.—The + or — refers to direction of obser 


The total chi square was 35408; for df = 104, a chi square 


The test of the quality of the relationship between 
the various client-therapist sequences was made by 
the method of chi square. Before proceeding with 
the investigation proper, however, considerable atten 
tion was given to the reliability of scoring.® 

5 The data regarding the reliability of scoring was 
gleaned from Fogel (1957). Thirteen judges of vary- 
ing degrees of psychological sophistication and ac- 
quaintance with these data were used in originally 
assessing reliability. Reliability was assessed sepa- 
rately with regard to client and therapist categories 
and whether the information was being obtained 
from verbatim typescripts or the tape recordings 
themselves. The material of judges found in Fogel’s 
original data who were found to be scoring sig- 
nificantly more or less statements than the rest of 
the judges and in comparison with the master judge 
(a composite scoring arrived at through agreement 
between the four senior research workers on the 
project), were omitted from the calculation of the 
percentage-of-agreement scores. It was felt that their 
scoring deviated so much beyond expectancy that 
inclusion of their material would artificially skew the 
reliability findings. The results of the tests of reli- 
ability demonstrate that with regard to the client 
categories from typescript: the average percentage-of- 
agreement of each judge with every other judge (in- 
terjudge) was 73% (range: 60-87%); the average 
agreement of each judge with the master scoring 
(judge-master) was 70% (range: 60-81%). With re- 
gard to the client categories from the tape record- 
ings: the average percentage of interjudge agreement 
was 70% (range: 55-86%); average judge-master 
agreement was 73% (range: 57-97%). With regard 
to the statements made by the therapists, from type- 
script: the average interjudge agreement was 69% 
(range: 63-76%); average judge-master agreement 
was 70% (range: 61-78%); from recordings: the 
average interjudge agreement was 70% (range: 61- 


Statements Pleasant ant ant 
Reflecting Affect in 
nptomsUncertainty Present 


ved frequency fr 
ot 61.9 


Client statements 
Unpleas- Unpleas- Under 
standing 
and In 
sight 


Affect in 
Present 


Affect in 
Past 


Tentative 
Striving 


3 —32 —i2 —34 —153 
—5 0 + +3 35 
+20 0 0 
+35 —2 13 1 -14 
+4 +1 +4 
—1 t 2 
all +8 
+4 
—17 ' +1 
0 -1 
0 +11 
—2 —6 —3 
1 2 +1 
0 +18 -2 


ym expected frequency ; chi squ 
is significant at the .01 level of « 


RESULTS 


The data of the chi square analysis are 
presented in Table 1. The total chi square 
(significant well beyond the .001 level of con- 
fidence) shows that a null hypothesis is un- 
tenable; the frequency counts must be at- 
tributed to something other than chance. 
More specifically, the responses made by the 
therapist definitely influenced the responses 
made by the client. In light of the explora- 
tory nature of the study, an arbitrarily high 
chi square of 100 was selected for sorting out 
the “interesting” cells, i.e., ceils which con- 
tributed most to the overall chi square. 

Table 1 has six rows containing interesting 
cells. Three rows are the “Forcing” categories, 
viz., Forcing Insight I and II and Forcing 
the Topic of Discussion. The remaining three 
rows are: Direct Questioning, Clarification of 
Feeling, and Interpretation. These are dis- 
cussed in order of decreasing contribution to 
the chi square. 

Forcing Insight I was the heaviest con- 
tributor to the chi square. With such a cate- 
gory the therapist focuses on a particular 
cause-and-effect sequence. He offers the client 
either the cause or the effect and asks for the 
missing element. Characteristic questions are: 


94%); average judge-master agreement was 89% 
(range: 83-95%). These findings compare favorably 
with those by Snyder (1945); for a more extensive 
treatment of the reliability data see Fogel (1957). 





VERBAL INTERACTION 
“What do you think made you so disturbed 
just at that time?” “How did this effect your 
social relationships?”, or the simple query 
after the client has expressed some sentiment, 
“I wonder why?” Inspection of the Forcing 
Insight row shows three cells contributing to 
most of the nonrandom element: the client’s 
statements of uncertainty (most frequently, 
“I don’t know,” following the therapist’s 
question, “I wonder why?”’), the client’s in- 
dication of Tentative Striving, and Under- 
standing and Insight. 

Of these three client statements, only the 
one relating to Tentative Striving warrants 
clarification. This category is used to de- 
scribe those responses implying that the client 
is thinking about taking a more active role 
or controlling responsibility in his life, in gen- 
eral, or in the therapeutic process, in particu- 
lar. The category includes both the timorous, 
‘Maybe I ought to ’ and the more posi 
tive, “I’m going to try ’ Usually, Tenta- 
tive Striving would be expected to be found 
in the later stages of the therapeutic process. 
These data suggest that it is possible to en- 
courage a desire on the part of the client in 
the early stages by the judicious use of leads 
by the therapist. It is interesting to note (see 
Table 1) that the only place where such ten- 
tative client formulation appeared in any 
great strength was following the Forcing In 
sight categories. 

Because Interpretation is considered to be 
outside the pale of nondirective therapy, the 
sequalae of the interpretations are particu- 
larly interesting. Interpretation was followed 
by an increase in Understanding and Insight 
and was followed by a decrease in Statements 
about the Problem. Both effects were well be- 
yond chance; our absence of comment is de- 
liberate. 

Clarification of Feeling has been the back- 
bone of nondirective therapy.® It is a magnifi- 
cent technique when done well (which is not 
easy). In a good clarification the therapist 
summarizes a complete area of confused emo 
tionality in one or two careful sentences. The 
client has the feeling of being completely un- 

6 “The primary technique [to promote the develop- 
ment of insight] is to encourage the expression of 
attitudes and feelings ... until insightful 
standing appears spontaneously” (Rogers, 
195). 


under- 
1942, p. 
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derstood: the impact is obvious. It is not sur- 
prising that Understanding and Insight re 
sults; this is the raison d’étre of the clarifi 
cation. 

Direct Questioning and Forcing the Topi 
(either nondirectively, as in “Would you like 
to tell me more about that?’’, or more direc- 
tively, “Tell me more about your mother.”’) 
can be discussed simultaneously. Both are 
highly directive in nature; the client is not 
giving much leeway in responding. Both re- 
sult in a marked decrease in the Understand- 
ing and Insight column. Direct Questioning, 
in addition, results in a marked increase in 
Statements about Problems. 

The last row to contain an interesting cell 
was Forcing Insight Il. Here, the client is 
given both elements of a cause-and-effect re- 
lationship and asked if he can determine the 
nature of the relationship: “Is there any re- 
lationship between your headaches and your 
fear of people?” It is one step short of inter- 
pretation. The difference between Forcing In- 
sight and Interpretation is in the motivation 
of the therapist, viz., that of trying to keep 
the initiative with the client. It is less than 
surprising that this lead results in a marked 
increase in Understanding and Insight. 

It is of interest to compare these results 
with those of previous investigations. On the 
one hand, these findings confirm those of 
Sherman (1945) with regard to the effect of 
the therapist’s clarifications and tentative for- 
mulations upon client insight. However, Berg- 
man (1951) and Snyder (1945) found that 
interpretations made by their therapists lead 
their clients away from insight and explora- 
tions; in Snyder’s data even clarification was 
not found to lead to insight. Bergman found 
that reflection, and Snyder, approval-and-en- 
couragement and simple acceptance of what 
the patient said, were followed by insightful 
statements. 

We can only conjecture about these dis- 
crepancies. One possibility lies in the manner 
in which we use the Forcing Insight cate 
gories. These are never used (particularly the 
first few trials) unless the therapist is reason- 
ably sure of getting some perception of in- 
sight on the part of the client. The wording, 
the phrasing, the inflection, and the timing 
are all carefully considered. The question the 
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therapist asks himself is, “How should I say 
it so he (the client) will see it?” Once this 
is determined, and if the client is prepared 
to explore in this area, the lead is offered. 

What must also be taken into account in 
the apparently discrepant findings are the 
several meanings attributed the same item in 
the categorization of the data by different 
writers, For example, Snyder (1945) used in- 
terpretation to mean, “Responses in which the 
counselor points out patterns and relation- 
ships in the material presented” (p. 199), as 
this was used in the present study. However, 
Bergman (1951) writes: 

. “interpretation” as used in the present study is 
synonymous with any counselor verbalization which 
conveys meaning not present in the client’s com- 
munication such as information or an opinion. This 


definition does not of necessity conform to the usage 
of other psychological writers (p. 223). 


A third factor in explanation of the appar- 
ently discrepant findings may be a function 
of the scoring procedure. It is conceivable 
that “insight” may be observed following cer- 
tain categories spuriously. For example, the 
client makes a statement expressing some as- 
pect of a problem; the therapist replies with, 
let us say, a clarification; the client then be- 
gins to explore areas about which he had here- 
tofore been unaware; the therapist encour- 
ages the exploration by nodding or grunting 
approvingly; the patient comes to an insight. 
In such a sequence, “insight” would be scored 
following “Mm-hm” or Approval and En- 
couragement, whereas in actuality it relates 
to some previous statement by the therapist 
such as the preceding clarification. 


SUMMARY 


Forty subjects were interviewed by four 
psychologists in a psychotherapeutic setting. 
The first 4 hours of the interviews were re- 
corded and categorized statement by state- 
ment. These data were then tested to deter- 
mine if the therapist’s statements had a se- 
lective influence on the client’s responses. The 
results showed such an influence. The first ob- 
servation was that both Clarification of Feel- 
ing and Forcing Insight had the same major 
effect on the client’s responses: both pro- 
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duced marked increase in the number of Un- 
derstanding and Insight responses (two to 
three times chance expectancy). The second 
observation was that Direct Question and In- 
terpretation were antithetic in their effects. 
Interpretation resulted in an increase in Un- 
derstanding and Insight and a decrease in 
Statements about Problems; the reverse was 
true of Direct Questions. Finally, the results 
reaffirm the conclusion drawn from previous 
research that there is a consistent relation- 
ship in the sequence of client-therapist state- 
ments, and along with the studies on the con- 
ditioning of verbal behavior, as reviewed by 
Krasner (1959), suggest that statements made 
by the client appear to be quite amenable to 
modification by the therapist, a finding that 
should have some implication for the endeav- 
ors directing their attention to the progress 
of therapy. 
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For more than 25 years psychologists have 
struggled with the problem of measuring intel- 
lectual deterioration due to mental illness. In 
reviewing tests of deterioration, Yates (1956) 
recommends longitudinal studies of deteriora- 
tion. “Clearly, the most satisfactory method 
from the point of view of reliability and va- 
lidity is by means of the test-retest method 
over a given period of time, using tests for 
which adequate statistical information is avail- 
able.” Since direct measures of the patients’ 
premorbid level of intelligence are usually 
not available, various cross-sectional methods 
based on the idea of differential deficit have 
been used to make indirect estimates of in- 
tellectual deterioration. This paper presents 
the results of a longitudinal study of cog- 
nitive deterioration in schizophrenia. 

Direct measures of intelligence before hos- 
pitalization (i.e., time of induction) are now 
available on all enlisted men in the U. S. 
Armed Forces. By retesting a patient as soon 
as possible after hospital admission, an esti- 
mate of cognitive deficit can be obtained. The 
present study uses these direct measures of 
cognitive deficit to answer the following ques- 
tions: 

1. What is the amount of cognitive deficit 
produced by a schizophrenic episode? 

2. Does a schizophrenic episode cause dif- 
ferential deficit? In particular, is a test in- 
volving vocabulary more resistant than other 
tests to the effects of schizophrenia? 

3. Is there a difference between paranoid 
schizophrenics and nonparanoid schizophren- 
ics in their premorbid intellectual level, or in 
the deficit produced by mental illness? 

4. Does cognitive deficit for schizophrenic 
patients differ in amount or pattern from the 
impairment shown by patients with recent 
diffuse brain injury? 


1 We are indebted to Murray Glanzer for his con- 
structive comments and technical assistance. 


PROCEDURI 
Measures 


This study 
tion Battery 


uses five tests of the Army Classifica- 
(ACB). These are multiple-choice pa- 
per-and-pencil tests with fixed time limits, given to 
all enlisted men when they enter the army. Each test 
has a standard mean of 100 and a standard 
deviation of 20. The ACB tests are used for predict 
Army 
age validity for this purpose being about - 
Zeidner, Harper, Johnson, & Uhlaner, 1954; 
1952) 

1. Reading and Vocabulary 
tains a number of 


score 


ing achievement at various schools, their aver 


(Fuchs 
Uhlaner 
The five tests are the following 

(RV): This test 
reading passages of one or 
paragraphs. Items are based on the content of 
paragraph. 


. 


Arithmetic Reasoning 
arithmetic 
»ome items are 


or graphs 


Test (AR): In this test 
problem presented ver- 
based on dials, tables 


each item is an 


bally scales, 
3. Pattern Analysis Test (PA): In this test a two 
dimensional pattern with numbered lines is presented 
along with the corresponding three dimensional fig 
ure made by folding the pattern along the indicated 
lines. The edges of the figure are lettered. The sub- 
ject is required to identify the lettered edge of the 
figure corresponding to a numbered line in the 
pattern. 

4. Mechanical Aptitude Test (MA): In this test 
each item consists of a figure illustrating some physi 
cal principle. The subject is to identify the correct 
principle 
The test 
speeded and administered 
limits. In Part I, Number Re- 
versal, each item consists of two numbers. The sub 
ject is to indicate whether or not the second num- 
ber is exactly the reverse of the first. In Part II, 
Coding, there is a key containing 10 words. Each 
word in the key has a associated with it 
For each item the subject matches the correct num 
ber with a given word 


5. Army Clerical Speed (ACS) 
of two parts, both highly 
with time 


consists 


separate 


number 


Montague, Williams, Lubin, and Gieseking (1957) 
found that RV, AR, and PA each have correla- 
tions of about .80 with the Wechsler-Bellevue total 
weighted score. A more detailed analysis of the ACB 
tests is given in the Montague paper. Since the ad- 
ministration of the five tests requires about 24 hours, 
it was usually necessary to have two separate testing 
sessions for the patients. 
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Subjects 


The ACB tests were readministered to four groups 
of Army enlisted men: (a) 78 patients with paranoid 
schizophrenia, (b) 81 schizophrenics without pro- 
nounced paranoid symptoms, (c) 64 brain injured 
patients, and (d) 162 control subjects, ie., enlisted 
men with no history of brain injury or mental ill- 
ness. All patients were tested within 1 to 3 months 
after admission to Walter Reed General Hospital in 
1957 and 1958. 

The subjects were tested in groups of 1 to 10 so 
that the examiners could answer questions and moni- 
tor individual performance. The age range for the 
four groups was 18 to 51, with a mean of 26 and a 
standard deviation of 7.3. The mean ages of the 
groups did not differ significantly. The retest inter- 
val for the four groups ranged from 1 to 140 months, 
with a mean of 24.5 and a standard deviation of 


TABLE 1 


DIAGNOSTIC BREAKDOWN OF SCHIZOPHRENIC 
PATIENTS BY SUBTYPE 


Group Number 


Nonparanoid 81 
Unclassified 
Simple 
Catatonic 
Latent 
Mixed 
Schizo-affective 


Paranoid 


Total 


25.2. There were no significant differences in mean 
retest interval between the groups. 

Table 1 shows the frequency of various diagnoses 
in the schizophrenic groups. These diagnoses were 
made by the senior staff of the psychiatry service at 
the end of the patient’s period of hospitalization. A 
description of the controls and brain injured pa- 
tients will be found in Williams, Lubin, and Gieseking 
(1959). 


RESULTS 

Deficit scores were computed by subtract- 
ing the Army entry score from the score ob- 
tained at hospital entry. 

Figure 1 summarizes the pattern of deficit 
scores for the control, brain injured and com- 
bined schizophrenic groups. The control sub- 
jects show a general increase in test scores on 
the second testing. The two clinical groups 
show a general decrease in scores, and similar 
patterns of deficit. 
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ARMY CLASSIFICATION BATTERY TESTS 
Fic. 1. Cognitive change on the Army Classification 
Battery for patients and controls. 


Figure 2 shows that the deficit pattern for 
paranoid is similar to that of the nonparanoid 
schizophrenics. A test of the differences be- 
tween profiles using a linear discriminating 
function showed that the paranoid and non- 
paranoid groups did not differ significantly on 








ARMY CLASSIFICATION BATTERY TESTS 


Fic. 2. Cognitive change on the Army Classifica- 
tion Battery for paranoid and nonparanoid schizo- 
phrenics. 





CoGNITIVE DEFICI 


rABLE 


T IN SCHIZOPHRENIA 


) 


Descriptive STATiIsTicSs ON Army Entry ACB Scores ror Controis, PARANOIDS, 
AND NONPARANOID SCHIZOPHRENICS 


™ 
78 Paranoids 


WU SD 
102.18 
92.06 
95.63 
96.46 
ACS 85.74 


Sum 472.07 





amount or pattern of deficit. Therefore, with 
regard to the five ACB tests, we can refer 
to a common “schizophrenic” pattern of cog- 
nitive deficit. Table 2 shows that the para- 
noid schizophrenics tended to have higher pre- 
hospital scores than the nonparanoid schizo- 
phrenics. The differences, however, were not 
significant. 

Since many of the schizophrenic patients 
were on medication at the time of the ACB 
test, an analysis of the effects of medication 
was made. In the sample of 159 schizophren- 
ics, 35% were on chlorpromazine, 10% were 
on mixed medication and were on no 
medication. There were no significant differ- 
ences on Army entry scores between the group 
selected for medication and the group on no 
medication (¢ test). The effects of medica- 
tion, dosage of chlorpromazine, etc. on the 


eco 
BP by /, 


446.800 


ores at Army entry 


162 Controls 


81 Nonparanoids 


Tf Uf SD 


SD 


95.89 
87.33 
93.09 
88.59 
81.96 


97.90 
93.18 
94.20 
96.43 
86.90 


20.34 
20.70 
21.11 
17.76 
19.25 


468.610 


deficit scores were tested by analysis of vari- 
ance. No significant differences were found 
It is assumed, therefore, that no correction 
need be made in this sample for the effect of 
drugs. 

Table 3 gives the summary statistics on 
cognitive deficit for the two schizophrenic 
groups. Similar statistics for the brain injured 
and control groups are given in Williams et al. 
(1959). For the schizophrenics, all the ACB 
tests except the Pattern Analysis test show a 
significant drop from premorbid level. To 
make an exact test of the null hypothesis that 
the average drop in the schizophrenic group 
is equal on all five ACB tests, C. R. Rao’s 
(1952) adaptation of Hotelling’s T? (1931) 
was computed (Anderson, 1958, pp. 110- 
112). The result was significant at the .001 
level. In general, therefore, the nonspatial 


TABLE 3 


DerFicit SCORE FOR PARANOID 


AND NONPARANOID SCHIZOPHRENK( 


PATIENTS ON Five ACB Tests 


78 Paranoids 


Sp 


AR 


PA 
MA 


D* 


* D is the sum of the deficit s« 
* Significantly different fro 


r all five ACB tests 


ro at the 


01 level or better 


81 Nonparanoids 159 Schizophr« 


SD V 


one-tail ¢ test. 
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tests (RV, AR, and ACS) appear to drop 
more after a schizophrenic episode than the 
spatial tests (PA and MA). The linear com- 
bination of ACB deficit scores that maximizes 
the difference between scores before and after 
hospitalization was computed. This optimal 
linear discriminating function was 50 RV + 
27 ACS + 10 AR — 10 MA — 73 PA, using 
an arbitrary weight of 10 for AR. These 
weights are in good agreement with the deficit 
pattern shown in Figure 1; in general, the 
nonspatial deficit scores (RV, ACS, AR) 
minus the spatial deficit scores (PA, MA) 
form the best linear function for detecting 
cognitive loss in the schizophrenic group. It 
should be noted that the most clearly verbal 
test, Reading and Vocabulary, has one of the 
highest positive weights. 


Comparison of Schizophrenics with Controls 


The summed deficit score, D, has a signifi- 
cant point biserial correlation of .35 with the 
criterion score obtained by assigning zero to 
all schizophrenics and unity to all controls. 
The linear discriminating function has a cor- 
relation of .38 which does not differ signifi- 
cantly from the .35 obtained just by adding 
the five deficit scores together. If an optimally 
weighted linear combination of the deficit 
scores RV and ACS is used, the correlation 
is .37. These results indicate that schizophren- 
ics not only show significantly more cognitive 
deficit than controls, but that this differentia- 
tion is particularly dependent on the Read- 
ing and Vocabulary and Army Clerical Speed 
tests. 


Comparison of Schizophrenic and Brain In- 
jured Patients 


As seen in Figure 2, the schizophrenic pa- 
tients have less overall impairment than the 
brain injured patients, but the general pat- 
tern of deficit is similar; the more verbal tests 
(RV, AR, and ACS) show more loss than the 
spatial tests (MA and PA). 

Statistical analyses were made to answer 
two questions: (a) Do the deficit scores for 
the schizophrenic group differ from those of 
the brain injured sample? (6) Can deficit 
scores be used to differentiate schizophrenics 
from the brain injured? The sum of the five 
ACB deficit scores, D, has a significant point- 
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biserial correlation of .31 with the dichotomy, 
schizophrenic vs. brain injured. A linear dis- 
criminating function using the optimal weights 
for the five deficit scores did not increase this 
correlation significantly. These results indicate 
that the brain injured patients suffer more 
cognitive deficit than the schizophrenics, but 
that the pattern of deficit for the two groups 
is essentially the same. Thus, the results are 
in agreement with the inferences previously 
drawn from Figure 2. The difference between 
the two groups has little value for individual 
diagnosis. Although, on the average, brain in- 
jured patients show more deficit, the maxi- 
mum likelihood cut-off point on the D score 
produces only 70% correct classification. The 
point of equal misclassification (see Williams 
et al., 1959, p. 303) yields only 67% correct 
classification. 
DISCUSSION 


The four questions posed in the introduc- 
tion can be answered as follows: 

1. What is the amount of cognitive deficit 
produced by a schizophrenic episode? 

The schizophrenic patients lose one-third 
to one-sixth of a standard deviation on RV, 
ACS, AR, and MA with PA remaining essen- 
tially unchanged. 

Longitudinal studies in which the same in- 
telligence tests were administered to schizo 
phrenics both before and after hospitalization 
are rare. The only other such study discov- 
ered in the literature was one by Rappaport 
and Webb (1950). They tested 10 schizo- 
phrenic patients, using intelligence tests that 
the patients had taken prior to admission to 
the hospital (during high school). On the av- 
erage, they found a deficit of 33 IQ points, 
with 9 out of their 10 patients showing large 
drops from their prehospital score. In terms 
of standard score units, this deficit exceeds 
that of our ACB study. However, half of their 
patients had been hospitalized for 16 to 81 
months, whereas all of the patients in this 
study were tested within 3 months of hospital 
admission. Presumably chronic schizophrenic 
patients with long hospitalization show more 
deterioration than 
ized patients. 


acute, recently hospital- 


2. Does a schizophrenic episode cause dif- 
ferential deficit? 





Cocnitive DEeErict1 
The schizophrenic patients show the great- 
est deterioration on RV, ACS, and AR, the 
more verbal or nonspatial tests. The pattern 
may surprise many clinical psychologists who 
have accepted the hypothesis that tests which 
involve vocabulary are the best measures of 
premorbid intelligence, and show the smallest 
drop with illness. However, as was pointed 
out in a previous article on direct measure- 
ment of cognitive deficit due to brain injury 
(Williams et al., 1959), the evidence for the 
hypothesis is not very convincing. 

The first quantitative evidence that verbal 
tests stand up best during schizophrenic de- 
terioration by H. Babcock 
(1930). Shapiro and Nelson (1955) tested 
this hypothesis and did not find support for 
it. They found that their psychiatric groups, 
including schizophrenics, 


was presented 


showed marked de- 
terioration on the Terman-Merrill Vocabulary 
Test. 
3. Is there a difference between paranoid 
and nonparanoid schizophrenic patients? 
The results indicate that the two groups do 
not differ significantly in their prehospital in- 


tellectual level, or in the amount and pattern 


of deficit, although the paranoids tend to 
show better scores. 

4. Is there a difference between the deficits 
found in schizophrenic and brain injured pa- 
tients? 
signifi- 
than the 
patterns of 
deficit are similar. The reasons for this com- 
mon pattern of deficit remain to be discovered. 


The brain injured patients show 
cantly greater amounts of deficit 


schizophrenic patients, but the 


SUMMARY 


Subjects, who were tested on the Army 
Classification Battery when they entered the 
Army and after they 
schizophrenia, showed a drop of one-sixth to 


The pat- 


were hospitalized for 


one-third of a standard deviation 
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tern of deficit was similar to that of brain 
injured patients; Pattern Analysis and Me 
chanical Aptitude show less deterioration than 
Reading and Vocabulary, Army Clerical Speed 
and Arithmetical Reasoning. The amount of 
deficit was considerably less than that showed 
by brain injured patients. The deficit scores 
could not be 
brain injury 
best cutting 
classification. 


used for differential diagnosis of 
from the 


schizophrenia, since 


only 7 


score shows correct 
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RIGIDITY AND CONFORMITY TENDENCIES OF JUDGES AND 
THEIR UTILIZATION OF AUTOBIOGRAPHICAL 
MATERIAL IN MAKING PREDICTIONS’ 


ZANWIL SPERBER 2 


Children’s Hospital of Philadelphia, Pennsylvania 


Interest in the relationship between rigidity 
and clinical judgments was stimulated by a 
study carried out by Dailey (1952) who 
showed that premature judgments made about 
another person “freeze” a judge’s opinion, 
and interfere with his ability to utilize addi- 
tional relevant information presented later to 
revise or improve his judgments. In Dailey’s 
research, as in the present study, the clinical 
judgments were made in the form of predic- 
tions of how another person had described 
himself. 

Hypotheses which follow from Dailey’s find- 
ings, and which the present research set out 
to test are: 

1. Judges who are more rigid will show the 
tendency to fixate on their premature judg- 
ments more strongly than judges who are less 
rigid. 

2. Judges who are more rigid will be less 
able to utilize additional relevant informa- 
tion presented subsequently to improve their 
judgments, than judges with less rigid per- 
sonalities. 

Underlying the hypotheses of this study is 
the assumption that the concept of rigidity 
refers to a personality trait which is suffi- 
ciently central to affect different classes of be- 
havior.* A personality trait refers to a sub- 
ject’s readiness to respond to a class of stimuli 
with a characteristic type or quality of be- 
havior. For the subject’s personality traits to 
be expressed in overt behavior the disposi- 

1The UCLA portions of the study were carried 
out as part of a research program studying “The 
Utilization of Internal and External Cues in Inter- 
personal Perception” supported by a grant from the 
University of California Faculty Research Commit 
tee. Ernest Rosencrans’ and David Raskin’s contribu- 
tions as research assistants are gratefully acknowl- 
edged. 

2 Now at Mount Sinai Hospital, Los Angeles. 

3 The writer’s definition of central and peripheral 
traits is similar to that given by Sanford (1956, p. 


352). 


tional state must be activated. This will occur 
only if the subject becomes ego-involved in 
the situation.‘ 

To be capable of arousing an adequate de- 
gree of ego-involvement we believe the be- 
havioral situation must: (a) mobilize the sub- 
ject’s interest in participating, (6) be per- 
ceived by the subject as pertinent to his 
needs and aspirations, (c) present tasks to 
be performed which are plausibly related to 
the particular traits, motives, or aspirations 
aroused. 

METHOD 


The study was initially carried out in the experi- 
menter’s University of Michigan Extension class in 
Personality, with student subjects. Two replications 
followed. In Study 2 the subjects were students 
in the experimenter’s Abnormal Psychology class at 
UCLA. In Study 3 the subjects in the experimenter’s 
UCLA Extension course in Abnormal Psychology 
All data were collected using unsigned, coded answer 
sheets. The data for the three groups will be pre- 
sented separately. 

As measures of personality rigidity, the California 
F Scale (Adorno, Frenkel-Brunswik, Levinson, and 
Sanford (1950) and Wesley’s (1953) Rigidity (R) 
Scale were administered in class early in the semester. 
The 28 item F Scale was presented in a true-false 
version to the Michigan group, and in the usual six- 
choice “strongly agree-to-strongly disagree” version 
to the other groups. Table 1 shows the means, stand- 
ard deviations, and intercorrelations of the F and R 
scores. The two rigidity measures correlated signifi- 
cantly but also seemed to get at different things. In- 
spection suggests that the R Scale places much more 
emphasis on items tapping perseveration and per- 
sistence, while the F Scale contains many morally 
toned items focused on duty, control of sexual im- 
pulses, and denial of feelings of weakness. 

The experimental task was presented near the end 
of the semester as a trial of a promising method for 
testing what students learn in psychology courses 


4 Vogel, Raymond, and Lazarus (1959) have made 
the same point with reference to the role of motiva- 
tion in determining whether a stressor state occurs. 
They say, “We propose that any typical stressor 
situation will produce a state of stress only if it 
threatens important ego-motivations of the S” (p. 


225). 
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TABLE 1 


ANS, STANDARD DEVIATIONS, AND INTERCORRELAT 
AND WESLEY’S 


R ScaLe Sci 


F Scale* 


ject VW 


Mix hig in 


Study 3 


1950. 


devoted to human functioning. The experimenter en- 
couraged the subjects to study and expressed a strong 
belief in the ultimate adoption of this type of test, 
but also told the subjects their 
count towards their course gradi 

The subjects read part of an autobiography, one 
of the sets of stimulus material previously used by 
Dailey (1952) and originally written by a beginning 
Kelly and Fiske’s 

The test required the subjects to 
60 items (Michigan group) or 100 
STDCR (Guilford, 


score would not 


clinical psychologist assessed in 
(1951) VA Study 
fill out the first 
items (UCLA groups)® of the 


5 The writer wishes to thank E. Lowell Kelly for 
his permission to use material from the files of the 
VA Project, and Charles A. Dailey for his 
tion in making available and discussing the 
teristics of the autobiographical material 
research 

6 To 


test” we 


coopera- 
charac- 
use in his 


maximize the reliability of the “prediction 
wanted it to contain as many items as the 


SD 


1940) “ as you think the whose auto- 
biography you just read filled it out Your suc 
cess in using the information you got from the auto 


biography will be a measure of the ig 


person 


knowledge you 
have gained in this course.” 

The STDCR had 
ussessees in a dichotomous yes 
subjects were directed to use only those 
Since the particular VA assessee used in the 
study did the ? alternative on 
analyses of the subjects’ 


been administ 


categories 
present 


choose two items 


accuracy of predictions will 
be based on data from 58 items (Study 1) or 98 
items (Studies 2 and 3). After filling out the STDCR, 
response sheets were collected and they 


were given the second half of the autobiography re 


the subjec ts’ 


subjects could read, comprehend and answer in the 
Michigan 
periorm 


choice of 


time available. Prior observation of the 
and UCLA subjects reading speed and test 
ance on regular exams dictated thx 


test lengt 


class 


rABLE 2 


ON READING 


HALF Vs. THE 


First Prediction* 


subject group 


Study 1 
Study 2 
Study 3 


Michigan 
UCLA 
UCLA 


based on a 58-ite 
the larger N per 
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taining their copy of the first half. They then re- 
peated the cross prediction procedure “on the basis 
of all the material now available.” As shown in 
Table 2, for all three groups the mean accuracy of 
the second set of predictions was higher than the 
first set, significantly so for the UCLA subjects. This 
suggests that the additional autobiographical material 
did provide new information relevant to the predic- 
tion task, although practice may 
tributed to the improvement. 


also have con 

We assume the subjects’ continued attendance in 
an elective course indicated interest in the subject 
and motivation to achieve mastery of the material 
By relating the experimental procedures to the con- 
tent and purposes of the course we attempted to 
take advantage of the subject’s interest in partici 
pating and make the procedure motivationally rele- 
vant. Since the students had been exposed to the 
idea that the ability to predict is an important proof 
of the worth of a theory we hoped the “prediction 
test” would have “face plausibility.” 

The Michigan subjects (Study 1) and most of the 
UCLA extension subjects (Study 3) did the entire 
procedure in the 24 hour class session. The Study 2 
subjects, with only 50 minutes of class time avail- 
able, were asked to do the second part at home and 
bring it back to the next class session. For Studies 1, 
2, and 3, respectively, 86%, 59%, and 75% of the 
subjects who took the personality test early in the 
semester contributed both sets of prediction data. 
Some of the loss in all groups was a function of a 
chance absence on a data-collection day, or of nor- 
mal drop-outs. 

At the end of the semester the experimenter rou- 
tinely asked his UCLA students to fill out a 15-item 
Instructor Rating Form (IRF) on which they could 
anonymously record judgments about various aspects 
of the course and teaching. The last item elicited a 
summary judgment. To explore the ideas stimulated 
by the pattern of results in Studies 1 and 2, the sub- 
jects in Study 3 filled out the IRF on coded sheets. 
Only the response to Item 15 will be discussed in 
detail. 


SPERBER 


rABLE 3 


CORRELATIONS BETWEEN RIGIDITY SCALé£ 
NuMBER OF ITEMS JUDGED 
Tests 1 AND 2 


SCORES AND 


THE SAME ON 


Rigidity 


Subject group Scale 


Mic higan 


Study 1 


RESULTS 


To test the hypothesis that there would be 
a positive relationship between rigidity and 
the tendency to fixate on initial judgments, 
product-moment correlations were calculated 
between the subjects’ scores on the F and R 
Scales and the number of items they answered 
the same way on Tests 1 and 2. The results 
are presented in Table 3. Contrary to ex- 
pectation all six correlations are negative, 
reaching an acceptable level of statistical sig- 
nificance in Study 1. 

We also hypothesized that there would be 
an inverse relationship between the subjects’ 
rigidity and their ability to use new informa- 
tion to improve their judgments. To test this 
hypothesis, product-moment correlations were 
calculated between the subjects’ scores on the 
F and R Scales and the degree to which their 
accuracy of prediction on Test 2 improved 


TABLE 4 


CORRELATIONS BETWEEN RIGIDITY 


Rigidity 


Subject group Scale 


Michigan—Study 1 


UCLA—Study 2 
UCLA 


Study 


» The 


* » 


subject's ac 
< .O5. 


SCALE 


SCORES Acct 


AND 
Accuracy 


Change* 


1? 

07 
»7** 

27 


05 
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over the accuracy of their prediction on Test 
1. The relevant data are presented in Table 4, 
in the column labeled Accuracy Change. As 
predicted, all six correlations are 
and one is statistically significant. 

The subjects’ scores on the F and R Scales 
were also related, separately, to the accuracy 
of their predictions on Test 1 and on Test 2 
These product-moment correlations can be 
seen in Table 4 in the Accuracy columns. The 
overall tendency for the relationship between 
subjects’ rigidity and judgmental accuracy to 


negative, 


be either less positive or more strongly nega- 
tive for the second set of judgments vs. the 
first set is compatible with the hypothesis. 


DISCUSSION 
The results of all three studies force us to 
reject the first hypothesis 
high on the F and R Scales tended to change 
their predictions more often than less rigid 


judges. 


Subjects scoring 


In speculating about this paradoxical find 
ing the following thoughts occurred. An im 
portant characteristic of the rigid personality, 
as described by Adorno et al. (1950, pp. 230 
234), 
cal way to the demands of an authority figure 
Perhaps implicit in the experimental sequence 
was the suggestion that the subjects should 
change their judgments. Block and Block’s 
(1952) research on the experimenter as an 
experimental variable would indicate that the 
high on the rigidity meas- 
would more readily conform to this im- 
plied demand. 

Although the subject had to change some 
of the predictions he initially made on Test 1 


is the tendency to submit in an uncriti- 


subjec ts scoring 


ures, 


as a necessary precondition for improving his 
accuracy on Test 2, the failure to confirm the 
first hypothesis does not indicate that Hy- 
potheses 2 will be infirmed. If the shifts in 
judgment of the more rigid subjects were pri- 
marily induced by the compulsive need to 
conform to an implied pressure 
your judgment,” then the shift 


“to change 
would not 
necessarily be based on the new information 
available. The results this line of 
thinking. The accuracy of the less rigid sub 


SUP] ort 


jects tended to improve more on retest than 
the accuracy of subjects who 
on the F and R Scales. 


score d higher 
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Chown’s (1959) review of the rigidity lit- 
erature indicates that the concept of rigidity 
does not refer to a unidimensional trait. In 
the present research it appears that the di- 
mension(s) underlying a tendency to main- 
tain a initiated was 
less important than the dimension(s) under- 
lying a tendency to adhere to an authority’s 
demands. On the one occasion where the pre- 
dicted relationship between rigidity and accu- 
racy change reached acceptable levels of sta- 
tistical significance, the measure involved was 
the F attitudes and 
impulses more closely related to the interper- 


response pattern once 


Scale which focuses on 
sonal behavioral orientation of the subject 

In Study 3, 
tions were explored further 


these interpersonal considera- 
Reasoning that 
liked the instructor 
would be involved in a desire to please him, 
they, like the rigid subjects, were expected to 
be sensitive to an 


the students who most 


implied request for new 
] 


predictions. Because of the increased prob 
ability that their decision to change predic 
tions would be in response to the perceived 
request rather than the new autobiographical 
information, a negative correlation between 
the Study 3 subjects’ ratings of how much 
they liked the instructor and their increase in 
accuracy on Test 2 compared with Test 1 was 
predicted 

The results were equivocal. There was a 
tendency, not statistically significant, for the 
subjects who liked the instructor more to 
make fewer changes in judgment (r 20) 
However, the prediction of a negative rela- 
tionship between degree of liking the instruc- 
tor and change in prediction accuracy was 
confirmed, r 37, although with the small 
N of 21 the correl 
cant at the .05 level 


Having shown that both authoritarian sub 


ition not quite signifi 


mission tendencies and liking the person in 
a position of authority who defined the task 


tended to decrease the subjects’ attention to 


the information contained in the material 
presented, we expected these factors to work 
together to enhance the judgmental behavior 
being studied. Instead, there were small nega 
tive relationships between the subjects’ F and 
R Scale scores and the degree to which they 
liked the 


instructor (r 04 and 21 
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respectively), less rigid students liking the 
instructor more. 

Gregory’s (1955) discussion of the distinc- 
tion between “authoritarianism” and author- 
ity helps clarify how a person can be in a 
culturally defined position of authority, and 
yet not be liked by more rigid (authoritarian) 
subjects. Ratings on the IRF showed that the 
experimenter and his teaching methods were 
considered to be nonauthoritarian. Conceiv- 
ably, the more authoritarian subjects disliked 
the instructor because of this. When given an 
opportunity to respond, not to an order, but 
to a request, they would be motivated to con- 
form but at the same time be disinterested in 
pleasing the instructor. 

The results of the three studies formed a 
consistent pattern, but most of the correla- 
tions reported were small and not statistically 
significant. In addition to the factors dis- 
cussed above, the decreased level of ego-in- 
volvement which resulted from administering 
the clinical prediction task as an experimen- 
tal rather than “real” examination also at- 
tenuated the correlations. Enough Study 2 
subjects did not return a second set of pre- 
dictions to add to the set collected in class 
to suggest that a substantial decrease in mo- 
tivational relevance occurred when the sub- 
jects learned the test would not “count.” 


SUMMARY 


Subjects’ rigidity was related to their tend- 
ency to change judgments about a person. 
Judgments were made after reading half of 
an autobiography and repeated after complet- 


ZANWIL 


SPERBER 


ing the autobiography. Contrary to expecta- 
tion, a positive relationship between rigidity 
and shifts in judgment was observed. The 
relationship between degree of rigidity and 
change in accuracy of the judgments, how- 
ever, was negative. More rigid judges did not 
respond in terms of the additional informa- 
tion available, but rather appeared to be be- 
having in terms of a need to conform to the 
implicit demands of the authority figure that 
they change their judgments 
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THERAPIST VERBAL ACTIVITY LEVEL AND 
DIAGNOSTIC UTILITY OF PATIENT 
VERBAL RESPONSES’ 


EDMUND S. HOWE ano BENJAMIN POPE 


Psychiatric Institute, 

An article recently published in this journal 
(Howe & Pope, 1961) presented an empirical 
derivation and extensive reliability estimates 
of a parallel pair of scales designed to quan- 
tify the Activity Level of discrete therapist 
verbal responses in the initial interview. The 
concept of Activity was argued to involve cor- 
related attributes of Ambiguity, Lead, and In- 
ference, such that when the therapist’s Ac- 
tivity Level is low his verbal responses are 
ambiguous, nonleading, and carry low infer- 
ence; and when, conversely, the therapist’s 
Activity Level is high his verbal responses are 
relatively unambiguous, leading, and carry 
high inference. It is assumed by many writers 
(e.g., Deutsch & Murphy, 1955; Finesinger, 
1948) that maintenance of a low verbal Ac- 
tivity Level (in the sense defined) enables 
the therapist not only to foster the develop- 
ment of transference reactions and to avoid 
a shift into a social relationship with the pa- 
tient, but also to elicit more relevant types of 
information from him. Indeed, several writers 
have already reported certain empirical con- 
sequences of variations in therapist Activity 
Level. Dibner (1953), for example, observed 
increased patient “anxiety” following greater 
therapist Ambiguity; while Bernstein, Len- 
nard, and Palmore (1958) observed greater 
“ease of communication” by the patient fol- 


1This paper arises from a program of research 
supported by Pilot Evaluation Grant No. 2M-6408 
C2-S1 from the National Institute of Mental Health 
of the National Institutes of Health, United States 
Public Health Service. The late Jacob E. Finesinger 
was the Principal Investigator. Thanks are acknowl- 
edged for his continuous encouragement and whole- 
hearted support of this work until his untimely death 
in June 1959. Actual performance of this experiment 
and completion of the manuscript have been facili- 
tated through tenure by.E.S.H. of Research Grant 
M-3355, also from the National Institute of Mental 
Health. Thanks are acknowledged to Allen T. Ditt- 
mann for reading and evaluating an earlier draft of 
the paper. 


University of Maryland School of Medicine 


lowing greater therapist Specificity (i.e., less 
Ambiguity). 

The purpose of the study now reported was 
to examine a further potential correlate of 
therapist Activity Level. One would expect 
that within a given interview the Activity 
Level of the therapist’s verbal responses would 
bear a demonstrable relation to the diag- 
nostic utility of the information thus elicited 
from the patient. Specifically, discrete pa- 
tient following low- 
active rather than high-active therapist re- 
sponses, 


responses immediately 


should be judged to carry greater 
diagnostic utility. The present experiment ex- 
amines this hypothesis. 

No attempt is made in this study to clarify 
the difficult question of a conceptual defini- 
tion of “diagnostic utility.” Rather, it 
decided at this stage to allow the raters to 
operate globally, as it 
inimitable 
them. 


was 


whatever 
naturally to 


were, in 


fashion came most 


Tue STUDIES 
Method 


In Study 1 ratings were obtained of the Diagnostic 
Utility of each discrete patient response in four ini- 
tial interviews. In Study 2 these ratings were sys- 
tematically compared with the Activity Level (ob- 
tained in Howe and Pope) 


response 


of the preceding therapist 


Procedure of Study 1 


Eight raters consisting of four experienced clinical 
psychologists with the PhD degree and four psy 
chiatrists having between 2 and 6 years of experience 
in or beyond residency were used in a modified latin 
square design earlier adapted (Howe & Pope, 1961) 
Cutler, Bordin, Williams, and Rigler (1958) 
For each of four interviews the subject rated discrete 
patient responses for Diagnostic Utility, along a four- 
point scale to be described below. For two of the 
four interviews presented to each subject the im- 
mediately preceding therapist response was present 
(“Context Present” condition) while for the other 
two interviews the preceding therapist response was 


irom 
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absent (“Context Absent” condition). This variable 
was an essential ingredient in the design since, aside 
from the desirability of controlling the influence of 
therapist responses on the subject’s ratings, a few 
patient responses (e.g., “Yes’’; or, “Two years ago’) 
would obviously have no reliably useful meaning 
whatsoever for the subject, if rated out of context 
Further, in order at least partially to control for the 
confounding of temporal order of occurrence within 
the interview and the manifest content of the pa 
tient’s verbal behavior, discrete patient 
from two of the interviews were presented to the 
subject (“Serial Arrangement” condition), 
while such responses from the other two interviews 
were roughly randomized (by shuffling) before pres 
entation to the subject (“Random 
condition). The “Sequence” of 
each subject, the “Order” in 

view was rated, the variable, and the 
“Arrangement” variable were systematically varied 
and controlled in the design shown in Table 1. It 
will be noted that four pairs of subjects, one psy 
chiatrist and one clinical psychologist, were treated 
identically within pairs. 

The four interviews are all initial ones. Interview I 
is taken from Wolberg (1954, pp. 690-699) 
ing of the first 73 patient responses; * Interview II 
is by Gill (Gill, Newman, & Redlich, 1954, pp. 134 
P Interview III, which 
was performed by an extremely close adherent to the 
Finesinger (1948) technique (Finesinger & Powder- 
maker *) and is consequently referred to here as 
“Finesingerian,” contains 61 responses; while Inter- 
view IV, with 48 patient responses, was performed 
by Rogers (1947, pp. 128-142). 

The “scale” of Diagnostic Utility was defined only 
by the points Zero, Low, Moderate, and High. The 
subject was instructed to sit in clinical diagnostic 
judgment as though he were in an initial interview, 
and to behave as idiosyncratically as he might wish 
He was asked to decide for himself whether to rate 
each patient response in terms of manifest sympto- 
matology, etiological 


responses 


seriatim 


Arrangement” 


rated by 


given 


interviews 
which a 
“Context” 


inter 


consist 


44) and contains 73 responses; 


evaluation of 
the patient’s defenses, or some combination of these 
(or other) criteria. (A trial run with annotated cate- 
gories clearly more definitive than the four men- 
tioned above had yielded a strong impression that 
the subject would generally be much more accept- 
ing of his assignment were he left to project his own 
interpretation of the scale and of the category labels, 
rather than try to comprehend a literal, and inevi- 
tably arbitrary, set of criteria presented by the ex- 
perimenter. The spontaneous reactions of all eight 
subjects to the instructions in no way discredited 
this impression.) The total time taken for the four 


considerations, 


2It was necessary to ignore two responses (e.g., 
a greeting) from each interview. Values of N are net 

}Finesinger, J. E., & Powdermaker, F. Part V: A 
clinical picture of claustrophobia. In, Seven films on 
interviewing and psychotherapy. Veterans Adminis- 
tration, Department of Medicine and Surgery, Wash- 
ington, D. C. 
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rABLE 1 


EXPERIMENTAL D1 


ent responses 
Random 


Context 


i text Context 
Raters nt Absent Present 


Absent 


PhD 


tasks varied from about 55 minutes to 24 hours pe! 
subject. 


Results of Study 1 * 


The analysis of variance is shown in Table 2 
It is observed that the four interviews are sig- 
nificantly discriminated from each other (p 
< .01), and this is a major requirement which 
it was hoped to satisfy. Neither the Context 
variable nor the Arrangement variable pro- 
duces a significant effect. While this is some- 
what surprising it should be noted that there 
is no logically compelling reason why these 
two effects should be significant in the pres- 
ent case. The overall low reliability of the 
ratings themselves, however, 
scured the effects of these 
degree. 

The between-rater reliabilities for each in- 
terview were computed by IBM, making avail- 
able a total of 4(8 x 7)/2 
moment correlation 


could have ob- 
variables to some 


112 product- 
These data 
are comparatively summarized in Table 3 for 


coefficients. 


psychologists (PhD group) versus psychia- 
trists (MD group). The slight differences be- 
tween the two types of subjects are not sig- 


nificant,® and in any case not systematic. For 


‘Many thanks are acknowledged to E.S.H.’s for- 
mer research assistant, Michael S. Black, now at the 
University of Illinois, for performing all of the office 
computations involved in both Studies 1 and 2 

5 Unless otherwise stated all 
ported in Study 1 were mad 
(e.g., Siegel, 1956, 111-116) 


comparisons of r’s re- 
with the median test 





THERAPIST 


rAB 


squares 


Sequences 4.00 


Raters within sequences 2.10 
Total between raters 


Order 


6.10 


Interviews 
Experimental cor 
Context 

Arrangement 

Context X Arrangement 
Pooled error 


i! within raters 


Grand Total 


* » 


01 


> < 001 


all three groups tabulated there is, however, 
consistently superior median agreement for 
Interviews I, II, and III, as compared with 
the median value for Interview IV (Rogers) ; 
but the differences are significant only within 
the All Subjects group (p < .02). 

The reliability data were also grouped ac- 
cording to various combinations of experi- 
mental conditions (e.g., “same Context, same 
Arrangement,” “same Context, different Ar- 
rangement,” etc.) but since none of the dif- 
ferences between these conditions is signifi- 
cant, tabular data are not reproduced here. 
Smaller median reliabilities for Interview IV 
in comparison with the other three inter- 
views occur, however, under all experimental 
conditions, but these differences 
significant. 


also are not 


ACTIVITY AND 


Dr1AGNostic UTILIty 


) 


19.14°** 


Finally, examination was made of the reli- 
abilities within the four pairs of raters whose 
experimental treatments were identical for 
each interview. While several of the differ- 
ences are statistically significant they are in 
no sense of a systematic nature, and no broad 
interpretive statement would be justified. 
Tabular presentation of these data also is 


therefore omitted. Once again, however, 

® Data referred to in this and the preceding para 
graph, together with a Table of Means and SDs of 
Diagnostic Utility Ratings for each interview, have 
been deposited with the American Documentation In- 
stitute. Order Document No. 7 from ADI Aux- 
iliary Publications Project, Photoduplication Service, 
Library of Congress; Washington 25, D. C., remit- 
ting in advance for microfilm or $1.25 for 
photocopies. Make checks payable to: Chief, Photo- 
duplication Service, Library of Congress 
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IV 
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Values of 
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III; and .37 for IV 
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b Each 
for the “All Subjects” g 


edian value i 
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and III are 
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across all conditions of experimental treat- 
ment Interviews I, II, and III show arith- 
metically superior reliabilities in 10 out of 
12 comparisons with Interview IV, of which 
six are significant (p < .05). 


Comments on Study 1 


The medians of the 28 interrater r’s for 
each interview are .38 for both I and II, .39 
for III, and .18 for IV. While the overall 
level of reliability is thus rather low, this is 
perhaps not too surprising in view of the type 
of judgment the raters were called upon to 
make. The implied difficulty experienced by 
the raters in judging Diagnostic Utility of 
the patient responses in Rogers’ interview 
(IV) complements the relatively low agree- 
ment shown by other raters (Howe & Pope, 
1961) in applying the Activity Scale to the 
therapist responses in the same interview. But 
the design. employed does at least have the 
advantage of having explored a number of 
variables which potentially were likely to in- 
fluence ratings of Diagnostic Utility; and one 
may accordingly feel somewhat more com- 
fortable about the uses to which these rat- 
ings are now put. 


Procedure of Study 2 


This was undertaken, it will be recalled, to facili- 
tate tests of the hypothesis that there is an inverse 
relation between the Activity Levels of discrete thera- 
pist responses, and the Diagnostic Utility Rating of 
the immediately following patient response. 

Eight raters had earlier performed the ratings of 
Activity Level of discrete therapist responses; and 
in Study 1 eight different subjects rated the immedi- 
ately following patient response. For each of the 
four interviews a block of 8 X 8= 64 Pearson ?r’s 
could thus be computed making in all a total of 
64 X 4 = 256 independent indices of relation between 
therapist Activity Level and Diagnostic Utility of the 
following patient response. If the hypothesis is to be 
borne out then the correlations should be negative 
in sign. 


Results of Study 2 


Table 4 shows the overall trend based upon 
64 r’s per interview. There is a considerable 
range of variation within each interview. In- 
terviews I, II, and IV clearly yield a majority 
of negative values (the respective proportions 
being .69, .83, and .84) even though none of 
the four median values approaches signifi- 


rABLE 4 


OVERALL RELATION (PEARSON r’s) 
LeveL (AL) oF THERAPIST 
nostic Utitity RAatinc (DUR) oF THE FOLLOW 

ING PATIENT RESPONSE; BASED UPON DATA 
FROM INDIVIDUAL PArrs OF SUBJECTS 


BETWEEN ACTIVITY 
RESPONSE AND DrAG- 


Number 
of nega 
Interview tive 


number Interviewer values 


folberg 44/64 


53/64 
22/64 


Rogers 54/64 


* Each median is based upon 64 7 
>’ For values of N, and for r 
significance, see Note Table 3. 


al values of 7 achieving 
cance. While one might appear to be theo- 
retically justified in adopting some suitable 
method to test the significance of the fre- 
quencies of negative values, such a procedure 
would be of dubious propriety since, as one 
has seen, the ratings of the two groups of 
eight subjects concerned show low to moder- 
ate within-group intercorrelation. 

A number of other measures were therefore 
made of the hypothesized relationship. 

For each interview a mean Activity Level 
(AL) score per therapist response was ob- 
tained by pooling the ratings of the eight sub- 
jects used in the earlier study. From the data 
of Study 1 a mean Diagnostic Utility Rating 
(DUR) was obtained for each patient re- 
sponse, based also upon the pooled ratings of 
all eight subjects. The value of r between 
these two sets of data was then obtained for 
each interview. The results are presented in 
Table 5. Correlations are both statistically 
significant and negative for Interviews II (p 
< .05) and IV (p< .05). The value for In- 
terview I is negative in sign but fails to reach 
even a 10% criterion, while that for Inter- 
view III is closer to zero and of positive sign. 
An argument could, of course, be made for 
application of the one-tailed test. But it 
should be borne in mind that while one may 
reasonably anticipate a negative relation be- 
tween the two variables at hand, existing em- 
pirical knowledge about the relation between 
Activity Level and Diagnostic Utility is pres- 
ently quite meager, and one is in no position 
to be dogmatic. 

A plot for each interview of the pooled AL 
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data against pooled DUR data showed a 
slight visual impression of curvilinearity for 
both Interviews II and IV. In each case a 
line sketched by hand suggested mean DUR 
to be an inverse, negatively accelerated func- 
tion of mean therapist AL. Departure from 
linearity was not confirmed in either case." 
Finally, a check was made upon the ques- 
tion whether the observed correlations be- 
tween AL and DUR for Interviews II and 
IV were somehow a function of the length, 
as distinct from the content of responses 
elicited from the patient. It is indeed pos- 
sible (a) that therapist verbal responses of 
low (as against high) AL elicited, in these 
two interviews, longer verbal responses from 
the patient; and (4) that DURs tended to 
be higher for longer patient verbalizations re- 
gardless of content. Were this the case then 
the observed values of r between AL and 
DUR would potentially be artifactual. 
Accordingly, for Interviews II and IV r’s 
were computed between (a) mean DUR and 
the frequency of clauses per patient re- 
sponse; * and between (4) mean AL of each 


7 A comparison of median correlations between the 
AL and DUR variables was made by breaking down 
the 64 r’s per interview into several subgroups (eg., 
PhDs vs. MDs; Serial vs. Random Arrangement; 
Scale A vs. Scale B, see Howe & Pope, 1961) in 
case significant relations lay hidden, cancelled out by 
opposing variables. Since none of these subgroup- 
comparisons yielded statistical significance they are 
not reproduced in the text 

8 The frequencies of clauses were counted by Aron 
W. Siegman, and B.P 


rABLI 


RELATION BETWEEN ACTIVITY 
Utinity RatTiInG FoR EAcu INTERVIEW 
UPON PooLeD DATA 


LEVEL AND 


Interview 


Wolbe rg 
Gill 
Finesinger 


Rogers 


Note.—"Pooled data” implies t 
response was derived, for each inter\ 
eight subjects used in Howe 
a mean DUR per patient re 

atings of all eight subjex 
per. 
* For values of N see Note 
*p <0.5 (two-tailed test). 


TABLE 6 


SUMMAER F THERAL 


RATINGS OF 


CORRELATION 

Activity Levert, DiaGnostic Utrity 

PATIENT RESPONSES, AND NUMBER OF CLAUSES 
OCCURRING PATIENT RESPONSES 


BETWEEN 


df = 
* Read: r between AL a 
clauses partia led out 
*p < .05. 
>) < .01. 
=>) < .001. 


therapist response and the number of clauses 
in the following patient response. The data 
are presented in Table 6. It is observed that 
for both interviews: (a) therapist AL bears a 
positive, but borderline relation to the Clause 
variable (p = .10); (6) DUR is negatively 
and significantly correlated with the number 
of patient clauses (p < .01). When the Clause 
variable is partialled out the net correlation 
between AL and DUR drops below a signifi- 
cant value for both interviews. 


DISCUSSION 


One should not be hasty in drawing from 
Study 1 a conclusion that the non-Rogerian 
interviews are generally more reliably rated 
for Diagnostic Utility than is the Rogerian 
one. While reliability indices for the non- 
Rogerian interviews are rather consistent de- 
spite the consideration that different inter- 
viewers and different patients are involved, 
both Client and Counselor variables are obvi- 
ously confounded with “Rogerianism” in In- 
terview IV. Further work designed to yield 
an indication of “within-Rogerian variance” 
would therefore be required before any justi- 
fiable conclusion could be drawn about this 
matter. 

With regard to the rather negative results 
of Study 2 it should first be noted that while 
reliability of the measures of therapist Ac- 
tivity Level can hardly be in dispute (the 
grand median was about .51, Howe & Pope, 
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1961), overall reliability of the Diagnostic 
Utility Ratings of patient responses is clearly 
lower (the grand median is .38). But reli 
ability in any case does not necessarily imply 
validity. The concept of Diagnostic Utility 
employed here is broad and undefined; and 
one may therefore seriously question whether 
ratings of printed transcriptions of patient 
responses in the manner described are more 
than superficially valid. 

One workable (and in retrospect perhaps 
fundamentally more desirable) alternative to 
a global measure of Diagnostic Utility would 
involve prior multivariate analysis of the di- 
mensionality of Diagnostic Utility. Such a 
procedure would hopefully enable the experi- 
menter to instruct the subjects (in the type 
of rating situation employed above) to rate 
patient responses in terms of specific, corre- 
lated diagnostic attributes, and to ignore 
certain other (independent) attributes which 
would otherwise contribute error variance and 
hence lower estimates of reliability. This 
proposal implies that if the basic hypothesis 
(of an inverse correlation between therapist 
Activity Level and Diagnostic Utility of the 
following patient response) is to be taken 
seriously, then in view of the unconvincing 
empirical outcomes of this study Diagnostic 
Utility should be assumed a multidimensional 
affair demanding preliminary conceptual clari- 
fication before the phase of hypothesis testing. 

An additional, serious difficulty concerns 
separation of the three partially confounded 
variables of therapist Activity Level, amount 
of patient output per response (the Clause 
variable) and rated Diagnostic Utility of the 
patient response. This problem will of course 
inevitably arise if one insists on “experimen- 
tal” study of naturalistic interview behavior. A 
partial solution to the impasse may lie in the 
experimenter’s deliberately controlling, as an 
independent variable, the therapist Activity 
Level within interviews: as, for example, by 
instructing the interviewer to respond within 
two or three different, operationally-defined 
ranges of Activity Level, during specified 
phases of the interview. This would facilitate 
close inspection, across a broad range of 
Activity Levels, of the relation between the 
Clause variable, and ratings of Diagnostic 
Utility. A significant chance relationship is 
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naturally to be expected between amount of 
patient productivity within a response, and 
Diagnostic Utility of the response as a whole. 
But it is not at all certain on the basis of the 
data presented here whether the subject arbi- 
trarily and nonconsciously identifies Produc- 
tivity with Diagnostic Utility, or whether 
these two variables are in fact intrinsically 
related in the four interviews studied. This 
ambiguity seems eminently suited for rigorous 
experimental clarification. 


SUMMARY 


This paper reports one test of the hypothe- 
sis that in the initial interview therapist ver- 
bal responses of low Activity Level (i.e., re- 
sponses which are ambiguous, nonleading, 
and which carry low inference) are more 
likely immediately to lead to diagnostically 
useful material from the patient. 

In Study 1 a latin square design required 
eight untrained, professional subjects to rate 
each patient response from each of four ini- 
tial interviews along a scale of Diagnostic 
Utility. The interviews were by Wolberg, by 


Gill, by a Finesingerian, and by Rogers. 


Therapist Context Present versus Therapist 


Context Absent constituted one factorial vari- 
able, and Serial versus Random Order of 
presentation of the responses to the subject 
constituted the other. The analysis of vari- 
ance showed a significant difference 
interview The interview by 
was significantly and consistently rated less 
reliably than were the 
interviews. 

In Study 2 the Diagnostic Utility Ratings 
(DURs) made by the eight individual sub- 
jects of discrete patient responses in each in- 


among 
means. Rogers 


three non-Rogerian 


terview, were systematically compared with 
ratings (independently obtained from eight 
other individual subjects) of .the Activity 
Level (AL) of each preceding therapist re- 
sponse. Were the hypothesis to be borne out 
then the r’s thus obtained should be nega- 
tive in sign. For three of the interviews there 
were 69, 83, and 84% negative r’s: although 
in no case was the median value (of 64 7’s) 
statistically significant. The median for the 
other interview was close to zero and of posi- 
tive sign. When pooled data were used to esti- 
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mate between 


the relation mean AL and 
mean DUR the hypothesis was borne out for 
the interview by Gill and for that by Rogers. 

It was subsequently observed, however, that 
for both of these interviews: (a) mean DUR 
per patient response was significantly corre- 
lated (p < .01) with the number of Clauses 
in that response; and (4) mean AL of the 
previous therapist response was evidently not 
totally unrelated to the Clauses measure (p 
= .10). When the correlation due to Clauses 
was partialled out the net 7’s between AL and 
DUR for these two interviews dropped be- 
low significance. 


The findings and some problems concerning 


measurement of the hypothesized relation are 
discussed. 
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A FACTOR ANALYSIS OF 


ROBERT C. NICHOLS ! 


Purdue University 


In the interpretation of human figure draw- 
ings clinicians have attributed different per- 
sonality significance to a large number of 
drawing characteristics and several scales have 
been developed for quantifying global evalua- 
tions of drawings along various dimensions. 
Research on the validity of the personality 
interpretation of various aspects of drawings 
as reviewed by Swenson (1957) and a study 
of a number of global scales by Striimpfer 
(1959) indicates that the usual interpreta- 
tions of figure drawings are of doubtful va- 
lidity. Whitmyre (1953), among others, has 
shown that when psychologists think they are 
judging adjustment from drawings they are 
really judging artistic quality. Thus, there is 
considerable evidence that clinicians cannot 
identify and correctly interpret the salient 
aspects of drawings. Nevertheless, a recent 
survey by Sundberg (1961) indicates that 
figure drawing, used as a projective test, is 
second only to the Rorschach in frequency 
of use in hospitals and clinics in the United 
States. Thus further study of this technique 
is badly needed. 

An alternative approach to the usual use of 
clinical intuition as a source of hypotheses 
about drawings is to determine empirically 
what dimensions are present in a number of 
drawing scores and to search systematically 
for personality correlates of these dimensions. 
It is the purpose of the present study to fac- 
tor analyze a variety of drawing scores in or- 
der to find what aspects of drawings might 
legitimately be considered independently and 
what aspects should be combined into a single 
score. 

METHOD 
Subjects 

Human figure drawings were available from two 

groups of subjects: a group of 107 male college stu- 


dents and a group of 90 male VA patients. The tests 
were presented to the college students as part of a 


1Now at National Merit Scholarship Corpora- 
tion; Evanston, Illinois. 


DRAW-A-PERSON TEST SCORES 


DEODANDUS J. W. STRUMPFER 


Potchefstroom University, Union of South Africa 


research project during a regular class meeting of in- 
troductory psychology and were group administered. 

The 90 VA patients were carefully selected to in- 
clude 30 “normal” patients hospitalized in a general 
hospital for nonpsychiatric disorders and with no 
psychiatric history, 30 patients with predominantly 
neurotic symptoms from the psychiatric ward of 
a general hospital and a VA outpatient clinic, and 
30 patients with predominantly schizophrenic symp- 
toms from the psychiatric ward of a general hos- 
pital. These groups were matched for mean age and 
education. All of the VA patients were individually 
tested with the test being presented as a regular part 
of the diagnostic procedure. Tests for the two psy- 
chiatric groups were taken from the files. Further 
details of the samples and testing procedures are 
given by Striimpfer (1959). 


The Drawing Variables 


In a factor analytic study it is necessary to sam- 
ple adequately the domain to be factored se that all 
factors affecting measures in the domain will be rep- 
resented. In the human figure drawing domain there 
are an infinite number of drawing characteristics that 
could be considered. There are, however, a limited 
number of characteristics that have been noted by 
clinicians to reflect the personality of the subject 
making the drawing, and it is with these aspects that 
the present study is concerned. In the clinical inter- 
pretation of figure drawings two main approaches 
have been used. (a) Global judgments are made of 
the degree to which the drawing taken as a whole 
reflects some characteristic such as adjustment, ag- 
gression, artistic quality, etc. (b) Specific details of 
the drawing are considered separately, and often 
several details felt to reflect a single personality trait 
are summed in a point scale to obtain a score for 
this trait. 

A number of scales have been published for quanti- 
fying global judgments of drawings by comparing 
the drawing to be scored with a standard set of 
drawings selected to represent the points on the scale. 
All of the scales of this sort that could be found in 
the literature were included in this study. These were 
scales for (a) “Adjustment” developed by Albee and 
Hamlin (1949), (6) “Sexual Differentiation” devel- 
oped by Swenson (1955), (c) “Maturity” developed 
by Dunn and Lorge (1954), (d) “Aggression” de- 
veloped by Striimpfer (1959), and (e) “Artistic 
Quality” developed by Wagner and Schubert (1955). 

In addition to the global judgment scales the point 
scales available in the literature were also included. 
These were the “Body Image” scale discussed by 
Fisher (1959), and the “Weighted Flaw” score, the 
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“Weighted Good” score, the “Net Weighted” score, 
and the “Percent Raw G” score developed by Buck 
(1948). 

A discussion of the method of scale construction 
and scoring for each of these scales can be found in 
Striimpfer (1959). For purposes of this study the 
scoring direction of the Adjustment, Artistic Quality, 
Body Image, and Weighted Flaw scales was reversed 
so that high scores are indicative of good adjust- 
ment, good artistic quality, lack of body image dis- 
tortion, and high intelligence, respectively. The other 
scales which were not reversed are scored so that 
high scores indicate a large amount of the quality 
indicated by the title. With scales scored in this way 
the better executed drawings obtain higher scores on 
most scales. 

In addition to the scores for the scales listed above, 
the height of each figure was measured in millimeters, 
and these height measures were combined in 
ways: 


two 
(a) the sum of the heights of the male and 
female figures and (b) the ratio of the male and fe- 
male figures. Also the 14 details that enter into the 
Body Image scale were scored separately as 2 sam- 
ple of the specific drawing details that are commonly 
interpreted clinically. The 14 details, which are de- 
scribed in greater detail by Fisher (1959), are: Era- 
sures, Transparency, Lack of any Body Part, Nose 
indicated by Two Dots, Mouth indicated by Single 
Line, Hands behind Back, Crude Clothing, Lack of 
Breasts in the Female Figure, Shading of the Body, 
Lack of Delimiting Lines, Figure off Balance, Figure 
very Small, Shading of Crotch Area, and Opposite 
Sex Drawn First. Each of these details was scored as 
present if it occurred in either the male or the female 
figure and as absent if it occurred in neither figure 

The interrater reliabilities for scoring drawings on 
the various scales were determined by Striimpfer 
(1959). These reliabilities, corrected by the Spear- 
man-Brown formula to the reliability of the average 
of two raters as used in this study were generally 
satisfactory and ranged from .59 for the Aggression 
scale to .95 for the Artistic Quality scale. The reli- 
abilities of the drawing details were estimated by 
scoring the male and female drawings separately for 
each detail and calculating phi coefficients as an in- 
dex of reliability over the two drawings. These co- 
efficients, which were not corrected by the Spearman 
Brown formula since the score used in the analysis 
was not a sum, are shown in Table 1. 


RESULTS 
Statistical Analysis 


The scores for the scales and drawing de- 
tails described above plus the age of the sub- 
ject and a dichotomous score indicating 
whether he was a member of the college or 
VA sample amounted to 32 scores for each of 
the 107 college and 90 VA subjects. The in- 
tercorrelations of these 32 scores, based on all 
197 subjects were calculated using product- 


DAP 


rABLE 1 


RELIABILITIES OF THE DRAWING 
COMBINED COLLEGE 


N 


DETAILS FOR THI 


AND VA SAMPLES 
197) 


Drawing detail Reliability* 
Erasures 
rransparency 

Lack of Body Part 
Nose as Two Dots 
Mouth as Single Line 
Hands behind Back 
Crude Clothing 
Shading 

Lack of Lines 

Figure off Balance 
Figure very Small 


* Reliability is a phi coefficient showing cons 
letail in the male and the female drawing 


moment coefficients for the correlation of two 
continuous variables, point-biserials for a con- 
tinuous and a dichotomous variable and phi 
coefficients for two dichotomous variables.” 
The resulting matrix was factored by the 
principal components method with commu- 
nalities estimated as the highest correlation 
in each row. On the basis of the size of the 
highest loadings and the size of the latent 
roots it was decided to retain five factors, 
which were rotated analytically to the Vari- 
max criterion of orthogonal simple structure.* 
After rotation the fifth factor had no ap- 
preciable loadings and was dropped from fur- 
ther consideration. The rotated loadings are 
presented in Table 2. 

After the above analysis was done, it was 
decided to make a further study of the fac- 
tor structure of some of the variables in the 
sample of VA patients alone. This was con- 
sidered desirable for three reasons: (a) An 
external criterion of adjustment in the form 


2 The correlation matrices from the combined sam- 
ple and the VA sample and the orthogonal factor 
matrix from the VA sample have been deposited with 
the American Documentation Institute. Order Docu- 
ment No. 7047 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress; Washington 25, D. C., remitting in advance 
$1.25 for microfilm or $1.25 for photocopies. Make 
checks payable to: Chief, Photoduplication Service, 
Library of Congress. 

3 These calculations were performed on 


Purdue 
University’s Datatron 205 computer. 
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TABLE 2 


ROTATED ORTHOGONAL FACTORS FROM 
AND COLLEGE SAMPLES COMBINED 


Score 


Adjustment scale 

Sex Differentiation 
Maturity (male figure) 
Maturity (femlae figure) 
Total Maturity 
Aggression (male figure) 
Aggression (female figure) 
Total Aggression 

Body Image 

Artistic Quality 

Weighted Flaw 

Weighted Good 

Net Weighted 

% Raw G 

Height M/F 

Height M + F 

Erasures 

lransparancy 

Lack of Body Part 

Nose as Two Dots 
Mouth as a Single Line 
Hands behind Back 
Crude Clothing 

Lack of Breasts on Female 
Shading of Body 

Lack of Delineating Lines 
Figure off Balance 

Figure very Small 
Shading of Crotch Area 
Opposite Sex Drawn First 


Age 
Membership in VA sample 


of the division of the sample into normal, 
neurotic, and psychotic groups was available 
for inclusion in the matrix, (6) The VA draw- 
ings were collected in the usual individual 
testing situation and are thus more typical of 


MPFER 
drawings used clinically than are the college 
drawings which were collected in a group 
(c) If the patterns of correlations in the col 
lege and VA groups differ appreciably, fac 
tors present in only one of the samples may 
be obscured. 

Fifteen of the variables were selected for 
further study from the 32 included in the 
first analysis by dropping part scores and 
variables with low communalities. A gross be- 
havioral adjustment score was added by as- 
signing the 30 psychotic patients a score of 
1, the neurotic patients a score of 2, and the 
normals a score of 3. These variables were 
intercorrelated using the 90 VA patients as 
subjects. The resulting matrix was factored 
by the principal components method with 
communalities estimated as the squared mul- 
tiple correlation of each variable with all 
other variables. On the basis of the size of 
the highest loadings and of the latent roots 
it was decided to retain four factors which 
were rotated analytically to the Varimax cri- 
terion of orthogonal simple structure. 

From an inspection of the orthogonal fac- 
tors it was felt that simple structure could be 
improved by oblique rotation of two of the 
factors. The factors were plotted graphically 
two at a time and rotated obliquely until in- 
spection of the plots indicated that further 


TABLE 3 


OBLIQUE Factor LoapINGs FRoM VA SaAMPL 


Score 


Gross behavioral adjustment 
Age 

Adjustment score 

Sex Differentiation 

Total Maturity 

Body Image 

Artistic Quality 

Net Weighted Score 
Height M + I 

Total Aggression 

Lack of Body Part 

Hands behind Back 

Crude Clothing 

Lack of Breasts on Female 
Lack of Delineating Lines 
Figure off Balance 


® Communalities obtained from orthogonal factor matrix. 


Factors 
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TABLE 4 


C-MaTrix, CORRELATIONS BETWEEN OBLIQUE 
FACTORS FROM THE VA SAMPLE 


Factor 


rotation would not improve the simple struc- 
ture. The loadings of the retained variables 
on the oblique factors are shown in Table 3, 
and the correlations between the 
vectors are shown in Table 4 


reference 


The Factors 


The rotated factor matrix of orthogonal 
factors from the first analysis done on the 
total group of subjects is shown in Table 2. 
These factors are labeled with Roman nu- 
merals to distinguish them from the oblique 
factors from the VA sample, which are desig- 
nated by letters. 

Factor I is a very 
counts for most of the 
among the drawing scores 


large that ac- 
common variance 
All of the scales 
with the exception of the Aggression scale 
have high loadings on this factor. The draw- 
ing details that load on the factor, all with 
negative loadings, seen to indicate poor or in- 
complete drawings. These are: Lack of Body 
Part, Crude Clothing, Lack of Breasts on Fe- 
Figure, Lack of Delimiting and 
Figure off Balance. When asks himself 
what the variables which load on this factor 
have in common that is not present in those 
variables which do not load on the factor, one 
is faced with two possibilities: (a) The factor 
may represent some broad dimension of psy- 
chological adjustment, since most of the high 
loading variables have been felt to be related 
to some aspect of adjustment. (b) The factor 
may represent the dimension of drawing abil- 
itv. Certainly the high scoring drawings on 
all of the scales loading on the factor appear 
to be better drawings, in the sense of look- 
ing more like a person, than the low scoring 


factor 


male Lines, 


one 


drawings, and the Artistic Quality scale has 
a loading of .90. 
A possible compromise interpretation is that 
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drawing ability and adjustment are themselves 
highly correlated and are both reflected in the 
factor. The Artistic Quality scale and the Ad- 
justment scale were correlated .86 with each 
other and both load highly on the factor. The 
analysis of the VA data to be discussed below 
may aid in deciding between these possible 
interpretations. 

Factor II is identified by the high loadings 
of age and membership in the VA sample 
This factor is due to the fact that two groups 
differing greatly in age were included in the 
sample. Had the score indicating group mem- 
bership not been included in the matrix, this 
factor would not have appeared, and age 
would have had a small communality. The 
group membership score was included because 
it was expected that many aspects of the 
drawings would vary from college to VA sam- 
ples. However, this is clearly not the case 
Che only way in which VA and college draw 
ings differ is in overall quality or adjustment 
of the drawing as is shown by the loading of 
group membership on Factor I, and the low 
loadings of the drawing scores on Factor II. 

Factor III is clearly a size factor, with 
Height and Figure very Small being the only 
variables with appreciable loadings. The small 
loading of the Body Image scale is probably 
due to the fact that size of figure is part of 
the Body Image score. The size of the draw- 
ing seems to have a low positive relationship 
with overall quality of drawing as is shown by 
the loading of the two size variables on Fac- 
tor I. Other than this, however, size has little 
relationship with other aspects of the draw- 
ings, as is shown by the low loadings of the 
drawing scores on Factor III. 

Factor IV is defined by the three Aggres- 
sion scores and has no other high loadings. 

The oblique factors from the VA sample 
are shown in Table 3. 

Factor A is clearly the same large “Quality- 
of Drawing” factor found in the total group. 


It is now possible to answer the question 
raised in the total sample of whether or not 


this factor could represent some broad di- 
mension of psychological adjustment. If this 
were the case, gross behavioral adjustment as 
represented by the division of the sample into 
normal, neurotic, and psychotic groups would 
be expected to load highly on the factor. In- 
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stead the loading of gross behavioral adjust- 
ment is —.02. The fact that this negative 
loading is not due to some artifact of the 
rotation is attested to by the correlation of 
—.03 between gross behavioral adjustment 
and the figure drawing Adjustment score. 
Thus the most plausible interpretation of the 
factor is one of overall quality of the draw- 
ing, perhaps due to individual differences in 
drawing ability which are independent of psy- 
chological adjustment. 

Factor B evidently represents a tendency 
to draw big bosomy figures and is negatively 
correlated with Factor A. 

Factor C is characterized by loading of 
Missing Parts and Hands behind Back and 
the negative loading of Figure off Balance. 
This might very tentatively be interpreted as 
defensiveness and constriction in drawing. It 
is negatively correlated with Factor A. 

Factor D is the only factor on which gross 
Adjustment is loaded, and it is thus of great 
interest to see what drawing variables are also 
loaded on this factor. No drawing variable 
has its major loading on this factor, but the 
positive loading of Sexual Differentiation and 
the negative loading of Absence of Breasts in 
the Female Figure may be considered mini- 
mally significant. 


DISCUSSION 


For the heterogeneous group of subjects 
represented by the combined college and VA 
subjects a single factor, interpreted as overall 
quality of drawing, accounts for most of the 
common variance among a wide variety of 
drawing scores. The additional factors found 
can be interpreted as artifacts of the method 
of scoring the variables. Although other inter- 
pretable factors were found in the VA sample, 
again most of the common variance was ac- 
counted for by a single factor. 

In their early thinking about these results 
the writers began referring to this large first 
factor as artistic quality, mainly because of 
the high loadings of the Artistic Quality scale. 
In a preliminary attempt to construct a scale 
of sample drawings to illustrate this factor 
they began sorting drawings from a new sam- 
ple according to artistic quality as defined by 
their experience with the scales loading highly 
on the first factor. They obtained very good 
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interrater reliability among psychologists in 
this endeavor, but when an artist from a uni- 
versity art department was asked to rate the 
drawings on artistic quality they found a near 
zero correlation between the ratings of psy- 
chologists and the ratings of the artist. The 
explanation for the low correlation was found 
in differing conceptions of artistic quality. 
The psychologists, as do the scales loading 
highly on the first factor, gave high scores to 
drawings that looked like a person or in some 
cases that looked like the cartoon character 
or caricature that the subject seemed to be 
trying to portray. The artist, on the other 
hand, was primarily concerned with balance, 
symmetry, freedom of expression, and esthetic 
appeal. Thus, the first factor, called overall 
quality of drawing, seems mainly to reflect 
the technical skill of the subject in executing 
a drawing and has little to do with esthetic 
appeal. Picasso would score very low on the 
first factor. 

Investigators who have set out to develop 
measures of such aspects of drawings as ad- 
justment, maturity, intelligence, sex differen- 
tiation, body image disturbance, and artistic 
quality have ended up with measures whose 
major. differences are in their titles. Appar- 
ently, overall quality is such a pervasive as- 
pect of the drawing and has such a large vari- 
ance compared with other factors affecting 
drawings that measures have most of 
their reliable variance accounted for by this 
factor. Even a number of very specific as- 
pects of drawings such as Lack of Body Part, 
Crude Clothing, Lack of Margins and Lines, 
and Figure off correlate with this 


most 


Balance 


factor of overall quality almost to the limit 
of their reliability. 
It is surprising that overall quality of draw- 


ing is not related to gross adjustment in the 
VA group, since the Adjustment scale has its 
major loading on this factor. Yet gross ad- 
justment has very little communality (.23) 
with the drawing scores at all. Although Lack 
of Breasts on the Female Figure and perhaps 
other aspects of sex differentiation have a low 
relationship with gross adjustment, they can- 
not be used as measures of this variable be- 
cause “lack of breasts” and the Sex Differen- 
tiation scale have their major loadings on 
other factors unrelated to gross adjustment. 





Factor ANAL 


The present analysis indicates that overall 
quality of drawing accounts for most of the 
variance in aspects of figure drawings consid- 
ered clinically significant. It also seems prob- 
able that the overall quality of a person’s 
drawings has little relationship to his psy- 
chological adjustment. If there are, indeed, 
aspects of figure drawings which are related 
to the personality of the person making the 
drawing, such a relationship is likely to be 
obscured in any group comparison by the 
large individual differences in overall quality. 
Thus, further research might profitably be di- 
rected to study of the correlates of certain 
drawing scores when overall quality of draw- 
ing is experimentally controlled 


SUMMARY 


\ factor analysis of a number of aspects 
of human figure drawings including global 
evaluations of the whole drawing and scores 
of particular aspects of the drawing yielded 
a single factor which accounts for most of the 
common variance among the drawing scores 
in a heterogeneous group of subjects. Another 
factor analysis of a smaller number of draw 

ing scores using 90 VA patients as subjects 
yielded the same large factor as the first 
analysis and three smaller, yet interpretable, 
factors. The major factor, interpreted as over- 
) 


all quality of drawing, was unrelated to gross 


ysis or DAP 161 


adjustment of the VA patients. Gross Adjust- 

ment defined a factor of its own on which 

the Sex Differentiation scale and Absence of 

Breasts in the Female Figure had low loadings. 
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AN EXAMINATION OF THE ROLE OF THE EXPERIMENTALLY 
DETERMINED RESPONSE SET IN EVALUATING 
EDWARDS’ SOCIAL DESIRABILITY SCALE’ 


JERALD N. WALKER ? 


University of Washington 


Wiggins (1959) has attempted to assess the 
relative efficiency of several announced meas- 
ures of the social desirability variable, in- 
cluding Edwards’ SD scale (1957). He has 
presented evidence which he interprets as 
calling into question the efficiency of the SD 
scale as a measure of the tendency to respond 
to self-report questionnaires in the socially 
desirable way. 

Wiggins’ (1959) procedure was to have 
two independent groups take the MMPI un- 
der different instructions. The control group 
given the standard instructions indi- 
cated on the test booklet. The experimental 
group was instructed to responses to 
the individual items which people in general 
would consider more desirable, independent 
of whether the statement was true or false as 
applied to the subject. He then compared 
these two groups in terms of the various re- 
determine which scale 
best differentiated between the control group 
and the experimental group. He held that the 
degree to which each scale differentiated be- 
tween these two groups indicated its efficiency 
as a measure of the tendency to respond in 
the socially desirable way. 

A primary criticism of his approach is that 
the SD scale was developed to measur the 
tendency to respond in the socially desirable 


was 


give 


sponse set scales to 
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with 


conditions of test- 
ing, and not to assess the degree of response 
change under experimental instructions (Ed- 
wards, 1957). In a later article (Edwards, 
1961), the point is reiterated: “. the sim- 
plest statement to be made is that it [the SD 
scale] provides a measure of the tendency of 
; 


subjects to give socially de 


direction under standard 


irable responses 


in self-description under the standard instruc- 


h pe rsonality inven- 


tions ordinarily used wit 
tories” (p. 353). 
(1959) in- 


consider 


A second criticism of Wiggins’ 
terpretation is the failure to ade- 
quately the reasonable possibility that social 
desirability was operative 


during the stand- 
To the degree that social 
desirability was operative under standard con- 
ditions it would not be expected that a dras- 


ard administration. 


tic change in the socially desirable direction 
would ensue upon specific instructions to so 
respond. The high initial mean would restrict 
the amount of subsequent change. Further, 
the items of the SD scale are fairly obvious 
in terms of social desirability-undesirability 
It seems reasonable that such would tend to 
elevate the control mean of the SD scale. 

Finally, there is reason to believe that Wig- 
gins’ (1959) experimental 
such that they did not 
of responding in the socially desirable way. 
Evidence for this lies fact that Wiggins 
failed to find a significant difference for men 
in the K scale of the MMPI. Rosen (1956) 
and Voas (1958), using procedures similar to 
Wiggins’ (1959), did find such a difference 
If Wiggins’ subjects failed to understand fully 
his experimental instructions 
the K and SD scales 
mized. 


instructions were 
iximize the amount 


in the 


differences in 
may have been mini 
For the above reasons, it is felt that the 
data and interpretations presented by Wig- 
gins (1959) are not adequate to evaluate the 


SD scale as a measure of the tendency to re- 
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spond to self-description items in the socially 
desirable way under standard instructions. 
STATEMENT OF 


THE PROBLEM 


The purpose of this study was to assess the 
degree and source of change in SD 
from standard instructions (control) to in- 
structions intended to maximize the amount 
of responding in the socially desirable way 
(experimental). The experiment was designed 
to allow a clarification of Wiggins’ (1959) 
results and to test hypotheses based on al 
ternative explanations of his data. 

Wiggins’ failure to demonstrate substantial 
differences in the K and SD scales may have 
been due to the failure of his subjects to un- 
derstand fully the experimental instructions 
Consequently, a concerted attempt was made 
to provide clear, explicit instructions for the 
experimental condition. Further, it was hoped 
that the amount of responding in the socially 
desirable way would also be maximized by 
the use of instructions closely similar to those 


scores 


used by Edwards in the original development 
of the SD scale. 


Predictions 


It was predicted that shifts in the K and 
SD scales, greater than those in Wiggins’ 
(1959) study would be demonstrated 

It was also predicted that the amount of 
change in SD scores from control to experi- 
mental conditions would vary inversely with 
control SD scores. This is because Low SD 
subjects, in relation to High SD subjects, 
were less likely to be responding, originally, 
in terms of social desirability. The amount of 
possible change, therefore, would be greatest 
for the originally Low SD subjects 


METHOD 


The MMPI was the basic instrument from which 
scores on the SD scale and other scales 
rived. The keys for the K and SD 
found in Dahlstrom and Welsh (1960) 
scale is described in Wiggins (1959). 

The sample consisted of 120 male college students 
from the Western Washington College of Education 
it Bellingham, Washington 

All subjects took the MMPI under both control 
and experimental conditions, in that order, with 
week between testing 


were de- 
scales will be 
Wiggins’ Sd 


approximately 1 intervening 


sessions. 
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DESIRABILITY 163 
condition were 
standard MMPI 


the experimental 


The instructions for the control 
those indicated on the front of the 
test booklet. The instructions for 


condition were as follows 


Last time when we gave you the MMPI, wi 
asked you to respond to the items honestly and to 
tell us whether each item was True or False a 
applied to you. Today, we how well 
you can do in answering the same 


want to se 
items so as to 
impression you can. In 
other words, we want you to answer each item in 
consider the True or th 
False response to be more desirable. If you think 
the Truc than the False 
then answer the item True. If you think the Falsé 
response is more desirable than the True, then ar 
swer the item False. For example, if 1 

ing to create a favorable impressi 


create the most favorable 


terms of whether you 


response is more desirable 


probably answer True to this iten 
I am regarded as an hor 

and you would probably answ 
I frequently hurt the feeling 
Remember, answer each it 

to create the most 

sion possible 


favorable 


For purposes of part of the analysis, the 
split into upper, middle, and low 

the basis of the control SD score cutting points 
at SD scores of 28 and 33. Subjects with scor 
falling at i 


subject 
were thirds on 
were 
these two cutting } 
assigned to the adjacent categorie 
group contained 40 subjects 

In summary, the design of the present study dif 
fered from that of Wiggins (1959) 
tant respects: the present use 
for both control 


posed to the use of independent 


in three 
of the 


and experimental conditions, as op 


} 


impor 
same subjects 
groups; analysis of 
the amount of change from control to experimental 
conditions for three separate groups, as 
Wiggins’ procedure where thi 
done; 


opposed to 
breakdown was not 
of experimental instruc- 
closely approximate 
those used in the original development of Edwards’ 
SD scale. The use of the same subjects allowed the 
analysis by separate groups which, in turn, permitted 
the determination of the source of the 


and the present use 


tions intentionally designed to 


greatest 
umount of change from control to experimental con 
ditions 
RESULTS AND DISCUSSION 

The means and variances of the SD, K, and 
Wiggins’ Sd scales for control and experi- 
mental conditions are presented in Table 1 
In all cases, NV = 120 males 

Wiggins’ (1959) data for men indicated a 
mean difference of — 1.52, ¢ = 2.20, p < .05, 
on SD, a mean difference of .52, ¢ 76 on 
K, which is not significant, and a mean dif- 
ference of — 12.02, f 17.89, p< .001 for 
his Sd scale. The present mean difference on 
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TABLE 1 

MEANS AND VARIANCES OF THE SD SCALE, 
(1959) Sd ScaALE, AND THE MMPI 
CONTROL AND EXPERIMENTAI 


WIGGINS’ 
K SCALE FOR 
CONDITIONS 


Condition SD K 


Control iT | 30.48 
o* 26.39 18.98 
Experimental ul 36.06 10.96 
a 16.91 0.46 
Control —Experimental D 5.58 5.98 
t 10.33* 13.00* 


14.98 


*>=.01 


SD was — 5.58, t = 10.33, p < .01, the pres- 
ent mean difference on K was 5.98, ¢-: 
13.00, p < .01; and the present mean differ- 
ence on Wiggins’ Sd scale was — 13.00, ¢ 
18.84, p < .01. 

This demonstrates that the present experi- 
mental instruction were effective in produc- 
ing a set to respond in the socially desirable 
way, and, apparently, more so than in the 
case of Wiggins’ study. Thus, the prediction 
of a shift in K and SD greater than that in 
Wiggins’ study was substantiated. 

It was further predicted that the amount of 
change would vary inversely with the original 
SD scores. This was examined by dividing 
the subjects into high, middle, and low con- 
trol SD groups and observing the amount of 
change in their SD scores from control to ex- 
perimental conditions. It was predicted that 
the amount of change would increase in the 
order of high, middle, and low SD groups. 
This prediction was confirmed. Table 2 pre- 
sents the SD data for each of these groups 
separately. In all cases, VN = 40. 

The summary table for the analysis of 
variance of these difference scores is found in 
Table 3. 

The F in Table 3 is significant and indi- 
that the groups differ in amount of 
change from control to experimental condi- 
tions. The mean differences as presented in 
Table 2 follow the predicted order of change. 
These results clearly show that the amount 
of change from control to experimental con 
related to original SD 
scores. They further suggest that to the ex- 
tent that individuais are responding on the 
basis of social desirability, as measured by 
the SD scale, little change would be expected 


cates 


ditions is inversely 
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TABLE 2 


CONTROL AND EXPERIMENTAL DatTA 


FOR Hicu, Mmp.e, anp Low C 


SD SCALE 
SD Groups 


ON THE 
ONTROI 


Condition Middle Low 


Control 30.92 24.78 
2.3 14.02 
34.82 
25.22 


10.04 


I xperiment 36.35 


S50 


5.43 


when they are explicitly instructed to respond 
on that basis. Furthermore, it would seem 
reasonable that the same considerations would 
hold true for other MMPI scales. That is, to 
the extent that a scale reflects the social de- 
sirability response set, and to the extent to 
which subjects respond on the basis of social 
desirability, little change in scores would be 
expected when subjects are explicitly 
structed to respond on that basis. 

As indicated in Table 2, the variances on 
the SD scale for all three groups are larger in 
the experimental condition than in the con- 
trol. The difference between the control and 
experimental variances is significant for the 
high group (¢ = 6.53, df = 38, p< .01) and 
middle group (¢ = 4.43, df = 38, p< .01) 
but not for the low group (¢ = 1.83, df = 38, 
p > .0S). To a certain extent, these discrep- 
ancies between the control and experimental 
variances would be expected in that the man- 
ner in which the three groups were selected 
would tend to maximize homogeneity of SD 
scores within each group for the control con- 
dition. Although these increased variances 
may reflect less than perfect consensus as to 


in- 


TABLI 


VARIANCE O} 
RIMENTAL) 
MiIppLe, AND Low 


SD Grout 


THE ANALY OI 
CONTROI 


Hicn, 


RESULTS OI 


SCORES 


(HANG 
FOR THI 
CONTROI 


Source 


Between groups 
Within grou 
Total 


** > <.005. 
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TABLE 4 


Most 


Experimental records 


Proportion 


Cutting callec 


score control 


37 or above 
36 or above 
Wiggins 

36 or above 


what constitutes a socially desirable response 
it should be pointed out that, for the entire 
group, the scores on the SD scale were more 
homogeneous under the experimental than un- 
der the control condition. The variances for 
the control and experimental conditions, 
dicated in Table 

2.49, df = 118, p < .05) 

The use of the same subjects for both con- 
trol and experimental conditions forces a con- 
sideration of the possibility of a simple regres- 
sion effect on SD scores. On this basis, it 
would be expected that the high group would 
shift in a downward direction, the middle 
group would remain approximately the same, 
and the low group would shift in an upward 
direction in the experimental condition. How 
ever, the results, as shown in Table indi 
cate that shifts for all three groups were in 
an upward direction. Only in the case of the 
low group would a regression effect augment 
the predicted experimental effect. However, it 
should be noted that a effect, 
which would have minimized the increase for 
the high group, and augmented the increase 
for the low group, would have operated in a 
manner favorable to the hypothesis that the 
greatest amount of change would be for the 
low group and the least amount of change 
for the high group. In this sense, it might be 


in- 
1, differed significantly (¢ 


regression 


argued that the effect of the experimental con 
dition may have been the same for all three 
groups. 

In the present study, that cutting point for 
SD minimized both 
false positives and false negatives in differ- 


Edwards’ scale which 


Proportion 
called 


63 


6 


/ 
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EFFICIENT CUTTING PoINTs FOR DIFFERENTIATING CONTROL FROM EXPERIMENTAI 
RECORDS ON THE Basis oF Epwarps’ 


SD SCALE 


Control records 


Proportion 
] 


} 
1€a 


cal 


experimental 


entiating the control from the experimental 
groups was determined. Wiggins (1959) found 
that the most efficient cutting point for Ed- 
wards’ SD scale was 36 or above. His data 
are reproduced in Table 4, as are the data 
from the present study concerning the cut- 
ting points for Edwards’ SD scale. In the 
present case, it was found that the most effi- 
cient cutting point was 37 or above. As indi- 
cated in Table 4, by using either a cutting 
point of 37 or above or 36 or above, the con- 
trol and experimental records are differenti- 
ated better in the present study than in Wig- 
war 

gins’. 

It seems likely that the present improve- 
ment in differentiation over Wiggins’ study is 
due to the greater efficiency of the experi 
mental instructions in the present study in 
establishing a uniform set to respond in the 
socially desirable way, since it will be noted 
that the improvement in predictive efficiency 
is primarily in the experimental records. 


SUMMARY 


This study was concerned with the source 
and degree of change in scores on Edwards’ 
SD scale as a function of experimental in- 
structions. The MMPI was 
120 male college students. 


administered 
On the 
first administration, the standard instructions 
were used. Approximately 1 week later, the 
same subjects were ag the MMPI 
with maximize the 
amount of responding in the socially desir- 
able way. Significant shifts in scores were 


twice to 


ain given 


instructions intended to 
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found for Edwards’ SD scale, the K scale of 
the MMPI, and Wiggins’ Sd scale. It was 
predicted that the amount of change in SD 
scores would vary inversely with control SD 
scores. The subjects were divided into high, 
middle, and low groups on the basis of their 
control SD scores. It was found that the least 
amount of change occurred in the high group, 
the next greatest amount the 
middle group, and the of 
change in the low group. 


of change in 
greatest amount 

The efficiency of the SD scale in differenti- 
ating control from experimental records was 
examined. An improvement in differentiation 
was observed in this study over that ob- 
tained by Wiggins. This was attributed to the 
facility of the present experimental instruc- 


tions in maximizing the amount of responding 


in the socially desirable way. 
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PERCEPTUAL IMMATURITY AND ACQUIESCENCE 
AMONG VARIOUS DEVELOPMENTAL LEVELS 


L. VAN 


pE CASTLI 


ty of Denver 


in- 
terest in developmental studies of perception 
(Wohlwill, 1960). An outcome of this inter- 
est has been the attempts to develop system- 
atic methods for the comparative study of 
normal and deviant perceptual responses. Uti- 
lizing Werner’s (1948) general developmental 
theory, a series of studies has been carried 
out where the attempt has been made to com- 
pare Rorschach levels of “genetic maturity” 
for various normal and psychopathological 
groups. A good summary of these studies is 
provided by Phillips, Kaden, and Waldman 
(1959). In general, the findings indicate that 
the more similar an adult Rorschach protocol 
is to that of a young child’s protocol, the 
more severe is the behavior pathology to be 
expected. 


There has recently been an increased 


Stemming from a different theoretical ori- 
entation, that of Berg’s (1955) deviation hy- 
pothesis, several studies have investigated the 
deviant responses of various psychopatho- 
logical groups to a series of abstract designs 
(Berg, 1959). In a study by Roitzsch and 
Berg (1959), it was found that the percep- 
tual choices of normal adolescents were simi- 
lar to those of neurotic patients but that the 
choices of both groups were significantly dif- 
ferent from those of normal adults. In an ear- 
lier study, Hesterly and Berg (1958) found 
that the perceptual choices of young normal 
children were similar to those of schizophrenic 
patients but that the choices of both groups 
were significantly different from those of nor- 
mal adults. 

If the results of these latter studies could 
be confirmed by other investigators, it should 
offer a promising means of constructing a 
scale of perceptual maturity based upon re- 
sponses to abstract designs. Such a scale could 
be more easily administered and scored than 
the Rorschach, and yet lend itself to the 
same sort of research problems investigated 
by the Rorschach genetic maturity approach. 


The purpose of the present study, therefore, 
was to discover if the results of Hesterly and 
Berg (1958) could be independently 
firmed. 

In their study, Hesterly and Berg adminis- 
tered the Perceptual Reaction Test (Berg, 
Hunt, & Barnes, 1949) to groups of ele- 
mentary school children, aged 7-12, and com 
pared their scores on Barnes’ (1955) Schizo- 
phrenia scale with the results of normal adults 
and schizophrenic adults on the same scale 
The Perceptual Reaction Test consists of a 
series of 60 colored abstract designs drawn 
with a ruler and compass and the subject 
selects one of the following four choices for 
each design: like much, like slightly, dislike 
slightly, or dislike much. Several modifica- 
tions of their procedure were made in the 
present study which will be described below. 


con- 


PROCEDURE 
The testing materials for this study 
a short form of the Welsh Figure Preference Test 
(WFPT) (Welsh, 1959) form consists of 144 
designs and the subject indicates whether he likes or 
dislikes each design. Some of the designs are drawn 
with a and ruler and some are free hand 
drawings. Since the test had not yet been made com- 
mercially available, when the present study was con- 
ducted it to prepare purple dittoed 
copies of than use the original 


consisted of 


Th 
nis 


compass 


was 
the rather 
black and white designs 

The WFPT small groups to 
second and third grade elementary school pupils in 
Pocatello, Idaho! and to Idaho State College stu 
dents. No elementary pupils were used if they were 


over 10 years of age 


necessary 


designs 


was administered in 


The Schizophrenia scale of the 


1 The author would like to gratefully acknowledge 
the assistance of the following persons in data col- 
lection: Evelyn Thurkill, coordinator of the Poca 
tello Public Schools; Richard Lake and Fred de 
Witt, psychologists at Idaho State Hospital South; 
Margaret Harris, Mo, Frank Pabian, and 
Cy Shepardson, students at Idaho State College 
Oscar Eichhorn, Russell Allen, and Mary Beauchamp 
assisted in data tabulation. Financial support was 
given through a University of Denver Faculty Re- 
search Grant. 
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MMPI had been previously administered to the col 
lege students and in order to obtain “normal” adults, 
only those receiving J scores below 65 on the Schizo- 
phrenia scale were included in the original sample of 
college students. Also, WFPT records were not used 
if 90% or more of the responses utilized only one 
consistent choice, i.e., checking 90% like or 90% dis- 
like. The number of subjects meeting the above three 
criteria included 200 elementary pupils and 200 col- 
lege students. 

An item analysis of the WFPT was then carried 
out upon the responses of these two groups. Using 
the CR technique for differences in proportions, 94 
out of the 144 items were found to differentiate be- 
tween the two groups at the .05 level of significance. 
Differential weights were assigned to these items in 
proportion to their discriminative power. These items 
were combined into a scale called the Perceptual 
Maturity Scale (PMS), which ranged in theoretical 
value from scores of 0 to 167, with high scores in- 
dicating greater maturity. 

The WFPT was then administered to new groups 
of elementary school children, to inpatients carrying 
a primary diagnosis of schizophrenia at Idaho State 
Hospital South who were judged to be cooperative 
for testing purposes, and to new groups of Idaho 
State College students. These various cross-valida- 
tion groups were then scored upon the 94-item PMS. 


RESULTS 


The PMS scores of the various cross-valida- 
tion groups are shown in Table 1 under the 
heading “Original PMS.” The elementary 
school children’s scores have been arranged 
according to the same age groupings reported 
in the Hesterly and Berg (1958) article. 
Similar to their results with the Barnes’ 
Schizophrenia scale, it was found in this study 
that the adult schizophrenic group obtained 
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PMS scores midway between those of nor 
mal adults and normal young children. 

Of course, it is possible that the schizo 
phrenic group may have occupied a midway 
scoring position in this study merely as an 
artifact of not having been considered in the 
original item analysis. Thus any sample of 
subjects of a type different from those used 
in keying the items would be expected to 
stand intermediate between samples repre- 
sentative of the original high group and the 
original low group. However, if this rationale 
held, one would have expected quite differ- 
ent results than those actually obtained by 
Hesterly and Berg. These investigators used 
Barnes’ (1955) Schizophrenia (Sigma) scale 
which was derived from contrasting the re- 
sponses of adult schizophrenic and normal 
groups and then later evaluated elementary 
pupils on this same scale. Applying the ra- 
tionale above, it would be predicted that chil- 
dren should obtain Schizophrenia scores inter- 
mediate between those of adult schizophren- 
ics and normal subjects, since no children 
groups were used in the original item analy- 
ses. The results found by Hesterly and Berg 
(1958) indicated that although there was 
no significant difference between the Schizo- 
phrenia scores of elementary pupils and adult 
schizophrenics, the children actually obtained 
scores below those of schizophrenics. 

In their study, Hesterly and Berg (1958) 
reported that the difference between the scores 
of their combined elementary school children 


TABLE 1 


Scores OF Cross-VALIDATION GROUPS 


Original PMS 


SD CR 


Group N M 


61.20 
68.68 
72.11 
67.91 
78.53 
103.81 2 


7- 8-year-olds 

9- 10-year-olds 
11-12- year-olds 
Combined pupils 
Schizophrenics 
College students 


2.48" 


3.62» 
8.23¢ 
15.614 


Dislike responses Revised PMS 


M S CR M 


CR 
57.28 
60.53 
58.60 
59.31 
63.47 
69.98 


17.30 
18.60 
20.02 
18.68 
20.35 


27.43 


Yo Nd ww LY 
Son oe Ul 
Ne Oma 
ee 


Note.—CRs in italics represent comparisons between groups with significant difference 


® Based upon comparison between 7-8-year-olds and 11-12-year-olds. 

> Based upon comparison between combined pupils and schizophrenics. 
® Based upon comparison between schizophrenics and college students. 

4 Based upon comparison between combined pupils and college students. 
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and their groups of adult shizophrenics was 
not significant using the ,? median test.* In 
the present study, the more sensitive CR tech- 
nique was used to evaluate the difference be 
tween mean scores. The resulting CR of 3.62 
between combined elementary school children 
and adult schizophrenics on the original PMS 
is significant beyond the .001 level. The dif- 
ference in mean original PMS scores between 
the adult schizophrenics and college students 
is also significant well beyond the .001 level 
(CR = 8.23). Within the college group, no 
significant differences were found as a func- 
tion of age, sex, or score on the MMPI Schizo- 
phrenia scale although students with higher 
Schizophrenia scores did obtain lower PMS 
scores. Within the elementary school group, 
PMS scores increased significantly in relation- 
ship to chronological age despite the fact that 
the differences in mean original PMS scores 
were probably minimized because extremely 
few 7-year-old and 12-year-old pupils were 
included in their respective groups. Hesterly 
and Berg also reported a relationship between 
pupil’s chronological age and scores on the 
Barnes’ Schizophrenia scale. 

In inspecting the original PMS it was noted 
that 74 of the 94 differentiating items were 
keyed in the dislike direction. Consequently, 
if a subject checked all 144 items in the like 
direction, he would receive a PMS score of 
34, and if he checked all 144 items dislike he 
would receive a PMS score of 133. This ob- 
viously suggested that response bias and origi- 
nal PMS scores might be too closely interre- 
lated. A sample of 100 cross-validation rec- 
ords was therefore drawn in such a way that 
the three groups of children, patients, and 
college students were about equally repre- 
sented and a comparison made between the 
original PMS scores and the number of dis- 
like responses given to the 144 items. The 
resulting correlation was found to be .784, 
which is significant well beyond the .001 level. 

The importance of response bias is also 
clearly shown in Table 1 by inspecting the 
column headed “Dislike responses,” which is 


2 The difference was even less significant than that 
reported by Hesterly and Berg as there is an arith- 
matic error in Table 3 of their article. The combined 
number of children above and below the median 
should total to 75 and 75 rather than 65 and 85. 
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a tabulation of the total number of dislike 
responses given to the 144 items. It can be 
seen that children gave the fewest dislike re 
sponses, followed in order by schizophrenics 
and then college students. The difference be- 
tween the mean number of dislike responses 
checked by children and college students is 
significant well beyond the .001 level (CR 

5.10). The tendency toward response bias 
was so pronounced among children and schizo- 
phrenics that approximately 7% of children’s 
records and 18% of schizophrenic records 
were not scored because they had over 90% 
like responses checked. Approximately 2% of 
children’s and schizophrenic’s records were re- 
jected because they included over 90% dis- 
like responses. Only one college student rec- 
ord was rejected for scoring purposes because 
of showing extreme response bias 

In order to construct a scale that would be 
free of response bias, the 20 like items of the 
original PMS were used along with the 20 
most differentiating dislike items. Each of 
these items was weighted one point and the 
cross-validation groups were then scored on 
this revised PMS.* The results with the re- 
vised PMS, shown in Table 1, reveal that the 
same relative ranks shown with the original 
PMS are preserved with this shorter form 
and that three of the four CRs are actually 
larger in magnitude with the revised PMS. 
A correlation between the revised PMS and 
number of dislikes was computed on the same 
100 records used for the previous correla- 
tional analysis and was found to be —.177, a 
value not significantly different from zero. 


DISCUSSION 


The results of the present study are in close 
agreement with those reported by Hesterly 
and Berg (1958). The main discrepancy in 
findings was that a significant difference was 
found in the present study between the com- 
bined scores of elementary school children 


3 The following revised PMS items are listed ac- 
cording to the numbers assigned in the present edi- 
tion of the Welsh Figure Preference Test. The 20 
like items are: 7, 23, 26, 71, 87, 117, 143, 159, 202, 
205, 235, 240, , 279, 315, 322, 357, 358, 399, 400. 
The 20 dislike items are: 14, 20, 42, 54, 79, 85, 120, 
121, 122, 188, 222, 228, 230, 286, 295, 296, 329, 334, 


352, 398. 
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and schizophrenics on both PMS measures, 
whereas Hesterly and Berg reported a non- 
significant difference for this same compari- 
son. This discrepancy may have arisen be- 
cause a more sensitive statistical evaluation 
was used in the present study. Both studies 
found a progressive relationship between 
chronological age among young school pupils 
and scores on the PMS and Schizophrenia 
scales, with the scores of the schizophrenic 
group being very similar to that of 11- and 
12-year-old children. It was also found in 
both studies that young children and adult 
schizophrenics obtained scores which differed 
from normal adult scores to a highly signifi- 
cant degree. These similarities of scoring pat- 
terns were found despite the obvious differ- 
ences between the two studies in the number 
and nature of the stimuli employed, the num- 
ber of response categories available for these 
stimuli, possible sample differences (particu- 
larly in the selection of schizophrenics and 
normal groups selected), and differences in 
the method of empirically deriving the test- 
ing scales. 

Both the earlier study and our result with 
the 94-item scale, could be interpreted as 


offering support for Berg’s deviation hypothe- 
sis (1955). Berg feels that the content of 
test items is relatively unimportant and that 
attention should be focused upon the style of 
responding. For this purpose, Berg (1959) 


claims “. it should be possible to use a 
series of abstract designs for test items and 
do about as good a job of measuring person- 
ality as can be done with traditional verbal 
items” (p. 89). According to Berg (1959), 
“What is needed are stimuli that will elicit 
deviant responses or, more accurately, stimuli 
which will produce ‘response sets’ or biases 
from which deviant response patterns may be 
statistically identified” (pp. 88-89). 
Confirmation for the importance of re- 
sponse bias was found in the present study 
as there was a highly significant difference 
between the total number of dislike choices 
given by children and college students, and 
although the reported difference in dislike 
choices between adult schizophrenics and nor- 
mals was not quite significant (CR = 1.87), 
it would have been significant if account were 
taken of the large number of schizophrenic 
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records which were not scored because over 
90% of the responses were checked “like.” A 
similar form of response bias also contributed 
significantly to the findings of Hesterly and 
Berg (1958) with the Barnes’ Schizophrenia 
scale. In his study, Barnes (1955) found that 
in comparison to normals, schizophrenics had 
a much greater tendency to choose like al- 
ternatives, so he constructed his scale to spe- 
cifically capitalize upon such differences in 
response bias. As an example of how different 
these two groups can be in response style, a 
recent article by Adams and Berg (1961) in- 
dicates that while normals prefer like and dis- 
like options to about an equal degree, schizo- 
phrenics averaged approximately 42 like to 
only 18 dislike responses on the Perceptual 
Reaction Test. 

It would be inaccurate, however, to claim 
that the results of the present study can be 
explained solely on the basis of response bias, 
because highly significant differences were 
found between the various groups on the re- 
vised PMS which had been specifically con- 
structed to eliminate the effect of response 
bias. This raises the possibility that there may 
be definite differences existing between the 
types of perceptual figures preferred by vari- 
ous developmental groups. An inspection of 
the designs which differentiated between chil- 
dren and adults suggests that children gen- 
erally prefer simple, symmetrical, geometric 
figures that are placed in the center of the 
available space while adults prefer more 
differentiated, heavily shaded, asymmetrical, 
complex designs. 

This preference for more differentiated, com- 
plex perceptual. patterns by developmentally 
higher subjects has also been found in the 
various Rorschach studies comparing the ge- 
netic maturity of various developmental 
groups (Phillips, Kaden, & Wadman, 1959). 
Some hope is thus offered that it may be pos- 
sible to devise a short, easily administered and 
scored, objective test of perceptual preference 
which should lend itself to the same type 
of research problems now being investigated 
by the Rorschach genetic maturity approach. 
The author is currently engaged in further re- 
search along these lines using a forced-choice 
technique with larger groups of subjects and 
attempting to assess the possible role of vari- 
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ables such as intelligence which were not pos- 
sible to evaluate in this initial investigation 


SUMMARY 


The present study was undertaken to dis- 
cover whether the findings of Hesterly and 
Berg (1958) could be 
firmed. An abbreviated 144-item form of the 
Welsh Figure Preference Test adminis- 
tered to 200 second and third grade elemen- 
tary pupils and 200 college students with low 
scores on the Schizophrenia 
MMPI. Those items significant 
difference between these two groups were in- 


independently con- 
was 
scale of the 
showing a 
corporated into a Perceptual Maturity Scale 


(PMS). 
school pupils, 


Several groups of new elementary 
students, and schizo 
patients totaling 487 subjects were 
then scored upon the PMS. Ranking for thes« 
groups in PMS 


vear-olds, 11—12-year 


college 
phrenic 
terms of increasing scores 
was 7—8-year-olds, 9-1 
olds, adult schizophrenic patients, and nor 
mal adults. This order corresponded to that 
reported in the Hesterly and Berg 
cept that they 


in scores 


study, e€x- 
reported that the difference 


between the combined elementary 
children and 


significant 


<< hool schizophrenic 


patients 
was not whereas the same con 
parison produced a significant difference in 
this study when evaluated by a 
tive statistical technique. It that 
a significant of .78 existed be- 
tween PMS scores and the total 


A re vised 


more sensi- 


was found 
correlatior 
number of 


dislike re sponses PMS constructed 
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to eliminate the influence of bias 
produced exactly the ranking of the 
various groups but to a more significant de 
These results with re- 
gard to Berg 

the Rorschach genetic maturity approach 


response 
same 


gree. were discussed 


’s deviation hypothesis and to 
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THE RELATIONSHIP OF 


DEPENDENT 
BEHAVIORS TO COUNSELOR STYLE 


LIKE VERBAL 
AND 


INDUCED SET! 


RONALD H 


ROTTSCHAFER ? ani 


GUY A. RI 


Southern Illinois University 


Dependency as a relevant characteristic of 
the focus of 
speculation in various writings, with little or 
no accompanying empirical investigation. In 
view of the fact that almost 20 years have 
now elapsed since Rogers (1942) first as- 
serted that dependency is increased by those 
who advocate a more leading, authoritative 
counselor role, this dearth of research seems 
strange. Rogers maintained, in this early chal- 
lenge, that the identification and emergence 
of the counselor as an “expert,” upon whom 
the client was to rely, would hamper the 
development of independent and self-reliant 
kinds of behavior in clients. He also pointed 
out that 


clients in counseling has been 


the leading-probing response too 


often led clients to attend to the counselor’s 


that is, to 
f 


rather than to their own; 
the counszlor had “in mind” for him- 
self or for the client. Rogers then argued for 
a more client-centered approach, defined by 
such operations as reflection of feeling, clari- 
fication, simple acknowledgment, and silence. 
Purportedly these categories of response would 
permit the client more freedom to mobilize 
himself for self-government. 

rhe authors discovered in their survey of 
the literature only the work of Guerney 
(1956) bearing on the influence of counselor 
style on client dependency. His definition of 
dependency is operationally reducible in most 
part to encompass the pertinent verbal be- 
haviors of clients. Guerney found no differ- 
ence in the frequency of such within counsel- 
ing, dependent behaviors in the clients of two 


data 
what 


senior 


1 Based upon a dissertation submitted bv the 
author to the Department of Psvcholog 


Illinois University in partia 
PhD deg 


the use 
style 
Snyder in hi 


Ford, and Guerney 


kinds of counselors, distinguished by 


of either a “leading or reflective” 


However, it was pointed out by 
introduction to Ashby 
(1957) that Guerney’s findings were possibly 
due to the assumption of a role by these 
to play either a 
Ap 
rressed, many of Guer 
toward thei: 
than to 
Other- 


summary of stud 


counselors, who isked 
leading or a 
parently, as time pr 


ney’s counselors | 


were 


reflective type interactor 


ined more 


typical and preferred style, rather 


the role they were supposed to play. 


wise, Rottschafer’s 
| 


ies on the effect of counselor stvle on proces 


+ 


variables and/or client characteristics has re 


vealed some contradictory results. Generally, 
investigators have found that reflective coun 


selors foster more responsibility, better judg 
their clients 
but 


t particularly favor 


ment. and less defensiveness in 
than 
postcontact reports do n 


one or the other 


. . a 
do the leading counselors chents 


Client expectations lso assumed to be 
crucial intrapersonal for 
hibiting the process of 

or changes in clients. Dx 
utterances licited and 
petuated by the anticipation that 
haviors will be reinforced. Grant 

McGowan (1954) are 
studies attesting to th 
certain expectations 

way the process will | n an 


facilitating or i 
ounseling itself and 
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pendent-like verbal 
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parently | 
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from these descriptive and conceptual reports, 
only a few scattered attempts have been made 
to determine the influence of experimentally 
manipulated set on client characteristics. 
Holman (1955) and Barahal, Brammer, and 
Shostrom (1950) have reported that pre- 
counseling instructions effected the set of 
their clients and certain postcounseling meas- 
ures. Biddle (1954), in addition, demon- 
strated that inducing observers to expect a 
certain kind of process, either problem cen- 
tered or relationship centered, influenced their 
subjective report of a role played counseling 
session. Although Biddle’s attempts at set in- 
duction were apparently successful, there is 
still little data to conclude that clients can 
be influenced by the set or instructions given 
them before their sessions begin. 

Since the role playing aspects of previous 
investigations lead 
sions, this study was designed to use actual 
counseling contacts conducted by counselors 
employing their typical style. Specifically, the 
purposes of this investigation were to explore 


to questionable conclu- 


further in the initial interview the following: 
1. The relationship of counselor style to 
the frequency of dependent-like statements 
by clients in the process; 
2. The relationship of 
struction to the frequency of 


precounseling in- 
these depend- 
ency statements; 

3. The interaction between the independent 
variables, counselor style, and induced set, on 
the dependent variable, client dependency 


METHOD 


Subjects. Voluntary clients were selected from 10 
beginning classes in general psychology at Southern 
Illinois University. This selection was done after the 
Mooney Problem Check List administered to 
them in their classes. The were not told 
that it was a preliminary phase of a research proj- 
ect; instead, they were informed that it was an in- 
structional sample of available psychological instru- 
To each Problem Check List was attached a 
sheet on could 
indicate whether they desired to “talk things over” 
with a staff counselor. The sheet also instructed the 
subjects to check on a five-point scak 


was 


subjects 


ments 


mimeographed which the subjects 


the extent of 
their motivation for counseling. It was then assumed 
that any student checked either Category 5 
very interested, or Category 4, interested, on the 
question, who stated a preferred 
who identified himself by name and ad 


Ww ho 


motivation also 


time, and 


dress could be considered as motivated and 
what like a “self-referred” client. 

About 25 to 30% of every undergraduate class ap- 
proached indicated a desire for counseling. Of this 
group, some 80% appeared on time for their inter- 
view, or they called in to arrange for a new 
This indicated that the method used for 
voluntary clients was indeed effective. The total num 
ber of subjects so obtained was 41 


time 


selecting 


Set induction. A leading or a reflective set wa 
randomly induced by the experimenter a fi 
utes before the subjects’ first contact. The 
were given a typed sheet of instructions to read, one 
that attempted to conceptualize what might occur in 
the initial contact. Briefly, the material for a refle« 
tive structure related that most counselor 
clinic preferred to let the client talk alx 


min 
subject 


ever he felt was important, that the 

decisions or give advice, and that such 

preferred to turn over to the client tl : 
bility for what occurred in the contact. Contrari 


wise, the subjects assigned to a leading set were ir 
structed that most counselors preferred to operat 
much the same as did other people in 
authority. Thus, by reason of their prof 

ing and experience, most counselors at tl 
that they were probably in the best positior 
what the subjects’ problems were, and 


ssional 


gest the solution 

After allowing enough time for the material 
read, the experimenter came back into the r 
for approximately a minute paraphrased c 
forced what the had just read. Thus, bot! 
written verbal means used to 
set. Following this, the client 
counselor. 


Selection of 


client 


and were induce 


was introduced 


Eight cou 
Included in the gro 
either psychology or 


counselors 
teered for this study 
PhDs in 
five MA 


Counseling 


, lar 
guidaan 


three 
four in psychology and one in social 
experience for the group rang 
to 8 years, with the median at 6 years. Ir 
were all 
enced counselors by others in the 


competent and 
area. None 
counselors were told the nature of the experin 
they were instructed only to conduct thei: 
views in whatever way they preferred. There is 
reason to believe that 
fully aware of the research design while 
having these counseling contacts 


they considered as 


these counselors wert 


Initial procedure. Counselor style wa 
prior to the experiment proper by e\ 
successive, initial contacts conducted | 
selor on other clients than the subject 
A scale recently developed by Strupp 
used to identify 
styles. This 
apart from the interactive 
that is, 
context. A counselor was judged 
stvle if at least 65% of his respor 
ploratory operations, interpretatior 


and refl 
counsel: 


leading 
scale classifies 
neaning 


it classifies all responses 





174 RoNALD H. ROTTSCHAFER 
direct guidance (categories 10, 30, 40, and 50 on the 
Strupp scale). A reflective counselor was one whose 
responses were classified as facilitating communica- 
tion (e.g., simple acceptance, silence, etc.) and clari- 
fication and/or reflection of content or feeling (cate- 
gories 00 and 20 on the Strupp scale) at least 65% 
of the time. Of the eight counselors who volunteered 
their services, four were found to utilize a leading 
style and three a reflective style. One counselor was 
eliminated in the initial analysis because his style 
did not meet the 65% criterion. On the basis of such 
studies Seeman (1948), Dipboye (1954), Mc 
Gowan (1954), and Danskin and Robinson (1954), 
it was assumed that each counselor would remain 
relatively consistent in any remaining interviews with 
the style demonstrated in his two initial tapes. Clients 
were then randomly assigned to these counselors, and 
each interview was evaluated directly to 
determine the frequency of client within process de- 
pendency, as defined by Guerney.? Random assign 
ment of clients and set resulted in 21 initial contacts 
for leading counselors, 12 of which were given a lead- 
ing set and 9 a reflective set. Reflective counselors 
saw 15 clients, 8 were prepared for a leading coun- 
selor and 7 for a reflective one.* 


as 


recorded 


RESULTS 
Reliability of Classification 


Reliability of the experimenter’s classifica- 
tion of counselor style was estimated by 
comparing his ratings with those of two ad- 
vanced PhD candidates in psychology. A ran- 
dom sample of 20° of the tapes was used to 
obtain rank-order correlations for each client 


based on category-by-category rankings on 
the Strupp scale. The median rank-order cor- 
relation was .94 between the three judges, 
with a range from .76 to 1.00. While this esti- 
mate was. based essentially on comparative 


frequencies of response within categories, it 


8 Guerney (1956) defined dependency as 

the extent to which the client asks the therapist 

for his opinions, or advice, information, evalua- 

tion, instruction, or demonstrates a need for struc- 
turing from the therapist. Also, it includes the de- 
gree to which the client places responsibility for 
the progress or outcome of counseling on the coun- 
selor rather than accepting it himself 
Examples of client statements classified as dependent 
are “what would you do in a case life that?”, “I’m 
open for any suggestions.”, “do you think I ought 
to come back?” 

4 Leading counselors had their responses on pre 
experimental tapes classified as such more than two 
thirds of the time, and reflective counselors were like- 
wise classified as using reflective responses two-thirds 
or more of the time 
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was also noted that there was a high agree- 
ment between judges on categorizing the re- 
sponses proper. 

Likewise, two experienced counseling psy- 
chologists were used to reclassify a random 
sample of tapes on the frequency of depend- 
ent-like responses by the clients. The median 
tetrachoric estimate of reliability based on 
agreement and disagreement for both depend- 
ent and nondependent statements was .985, 
with a range of .94 to .99. These estimates 
were based on a response-by-response analy- 
sis, and the number of responses varied from 
52 to 95 for the interviews judged. Again, 
20% of the tapes were used as the sample. 
These findings support the observation that 
the classification schemes for both style and 
dependency were reliable 


Initial Analysis 


Since the contacts varied from 40 to 70 
minutes in length and because counselors were 
variable in the amount of talk they permitted 
their clients, it was deemed more meaningful 
to score this dependency on the basis of the 
proportion of dependent statements to total 
client of the 
sheer frequency of occurrence, as originally 
planned. 


responses, instead on basis of 


The results of the two-by-two analysis of 
variance, using Tsao’s method, on frequency 
of dependent-like statements are given in 
Table 1. None of the F values (based on pro- 
portions) were significant. It will be seen that 
this initial finding is somewhat at variance 
with that noted in a later analysis based on 
reclassification of counselors 
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TABLE 2 


CONSISTENCY OF COUNSELOR STYLE 
FROM CLIENT TO CLIENT 


Chi Contingency 


Counselor style square coefficient 


Leading counselors 
29 67** 
28.72** 
34.93 
69. 84*** 
Reflective counselo 
1 31 54** 


? 77.03*** 


60.50*** 


Counselor Consistency 


It may be recalled that an assumption basic 
to the study was that counselors would main- 
tain a similar style from contact to contact. 
Since a negative finding in this respect may 
negate the implications of the initial findings, 
each experimental contact was reanalyzed to 
classify the actual style used by counselors 
The reader is referred to Table 2 for the re- 
sults of this analysis. Significant chi squares 
indicate that six of the eight counselors were 
inconsistent in style. The reported contin- 
gency coefficients conveniently represent these 
chi squares in correlational form; the higher 
the value, the greater the inconsistency. In 
view of the fact that contingencies 
were about equal (.30 for leading and .31 for 
reflective counselors) 


median 


both previously classi- 
fied groups may be said to be equally as in 
consistent from tape to tape. These findings 
contrary to those reported earlier, indicated 
that the initial analysis did not properly re- 
flect the relationships between counselor style 
and frequency of dependent-like talk 


Reclassification of Styles 


When counselors were re-evaluated for style 
on each contact, it was found that there was 
not the dichotomy of leading versus reflective 
counselors, as previously supposed. Several 
counselors actually used a mixture of the two 


styles. Consequently, counselors were recate- 


rABLE 3 


MEAN PROPORTION OF DEPENDENT 
STATEMENTS FOR STYLE AND SET 


Percentage 
leading 
responses 


Induced set 


Leading Reflective 


67 100% 
VY = 10 


0762 0468 


34-607 


0361 0219 


0408 


gorized on a continuum of leading responses 
from 0 to 100%, and each was arbitrarily 
classified as low (0 to 33%), medium (34 to 
66%), and high (67 to 100%) in leading be 
Table 
cell for combinations of counselor style and 
set. This time a two-by-three analysis of vari- 


haviors 3 contains the mean for each 


ance was utilized based on two induced sets 


and three counselor styles. In this new two 


by-three factorial design, mean values 


] 


were 


added to each of the three cells under reflec- 


tive set to bring about proportional frequen- 
criterion 
leading 

aban- 
all eight counselors were now included 
in the analysis. The results are given in 
Table 4. The F values for both counselor style 
and induced set were significant at the .05 and 


cies. Furthermore, since the initial 


used to classify counselors as either 


or reflective (65° of responses) was 


doned 


.06 level respectively. Apparently the addition 


of the eighth counselor. whose clients received 


rABLE 4 


is OF VARIANCE OF CLIENT DEPENDEN 
UNSELOR StyLeE Was Det 


EXPERIMENTAL CONTACT 


ERMINED FI 


Mear 


006951 
006469 
000308 
001842 


Between 
Between st\ 
Interact 
Within 





176 RoNALD H. RotTscHAFER 
extreme scores, is the explanation for the sig- 
nificant variance now attributable to set, 
which the initial analysis failed to show. With 
due respect given to the risk involved, it was 
decided to reject the null hypothesis for set 
at that level. This method of classifying coun- 
selors in three groups revealed that both coun- 
selor style and precounseling instructions are 
apparently related to dependent-like talk by 
clients. The interaction between counselor 
style and precounseling set was found to be 
nonsignificant. 


DISCUSSION 


The finding that reflective counselors foster 
less dependent-like talk by clients than do 
leading counselors lends credence to what 
many Rogerians have for years maintained 
on theoretical and, often, experiential grounds. 
Too, precounseling instructions seemingly have 
some bearing on at least this kind of client 
characteristic. Such limited results must be 
supported, however, by further empirical in- 
vestigation before more definitive conclusions 
can be made. At present it is not even known 
whether a positive or negative value can be 
placed on the dependency that clients dis- 
play while in counseling. For instance, psy- 
choanalysis asserts that a client is made de- 
pendent so that he can become independent. 
This, of course, remains to be demonstrated 
empirically, as does the Rogerian belief that 
transference and other dependent-like coun- 
seling behaviors do not facilitate growth or 
maturity in general. In any event, it appears 
that precounseling instruction can exert some 
control over client dependency, regardless of 
counselor style and vice versa. 

If subsequent research bears out the belief 
that the frequent appearance of dependent- 
like statements in clients while in counseling 
is in fact a barrier to independent living, or 
associated with negatively valued activities, 
then consideration should be given to the im- 
plementation of some form of precounseling 
instruction to reinforce a less dependent ap- 
proach. It has been speculated that precoun- 
seling group meetings, designed to let pros- 
pective clients “work out” their private ex- 
pectations, may well serve to orient clients to 
the assumption of more responsibility in coun- 
seling. Another suggestion has been to lay the 
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groundwork for independence in the intake 
interview. The selection of a workable method 
or methods of preparing clients to assume in- 
dependence in counseling is necessarily de- 
pendent on the results of future research. 

In one way, the results of the present study 
may not be in complete agreement with the 
speculation that clients would actively resist 
any instructions and even counselor behav- 
iors which are at odds with their own pre- 
conception of the “authoritative” counseling 
situation. At least in the first contact, pre- 
counseling instructions are apparently acted 
upon in some way. It may well be that clients 
are only “playing along” with the counselor 
until they (the clients) get a better feel for 
the interaction 
attempt was made to 


pattern. Of course, since no 
assess the private ex- 
pectations of these clients, there are no data 
actually the extent to which these 
personal sets were suppressed or altered. Yet 
it does appear that clients attended to the in- 
structions, whether congruous with their own 
beliefs or not. 


to show 


SUMMARY 


Forty-one clients, college stu 
dents. were first given written and verbal in- 
struction to expect either a leading or reflec- 
tive counselor, and then they were randomly 
assigned to a counselor. who had been previ- 
ously classified as leading or reflective. Only 
the first contacts were evaluated and scored 
on the proportional frequency of dependent 
statements made by these clients. Analyses 
were then made of the relationship of this 
kind of client dependency to counselor style 
and precounseling instructions. 

It was concluded that there is a relation- 
ship between dependent-like behaviors of 
clients in counseling and the style of the 
counselor. Reflective counselors tended to get 
less statements classified as dependent. Pre- 
counseling structuring also seems to influence 
dependent-like talk in clients. Several concep- 
tual and practical implications of these find- 
ings were mentioned 


voluntary 
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ANXIETY—AROUSAL 


RATED CORRELATES OF THE 
INTERPRETIVE 


EDMUND 5 


Psychiatric Institute, University 


Since most theories of psychoanalytic psy- 
chotherapy assume interpretation to be a cru- 
cial therapist operation promoting therapeu- 
tic progress, many investigators, during the 
last decade, have naturally focused their at- 
tention upon therapist interpretive behavior. 
They have been concerned to show that in- 
terpretive behavior is quantifiable along a 
unitary dimension of Depth, and predictive 
of certain alterations in the patient’s behav- 
ior, as a function of the Depth variable. 

Following Collier's (1953) earlier work, a 
notable step toward these goals has been made 
in a systematic program of studies performed 
in the University of Michigan laboratories 
(e.g.. Bordin, Cutler, Dittmann, Harway, 
Raush, & Rigler, 1954; Cutler, Bordin, Wil- 
liams, & Rigler, 1958; Harway, Dittmann, 
Raush, Bordin, & Rigler, 1955; Raush, 
Sperber, Rigler, Williams, Harway, Bordin 
Dittmann, & Hays, 1956; Speisman, 1959). 
Several of these studies indicate, among other 
things, that psychotherapists can rate inter- 
pretive statements with moderate reliability 
using an empirically founded scale of Depth 
of Interpretation (derived basically from 
Fenichel’s, 1941, definition of Depth). Fur- 
thermore, in four experiments designed to ex- 


amine the dimensionality occurring in judg- 


ments of a number of interpretive “exchanges” 
Raush et al. (1956) found Depth of Inter- 


1 This paper arises out of a research program (of 
which E. S. H. is the principal investigate sup- 
ported by research grant No. M-3355 from the Na 
Instit Mental Health of the National 
Institutes of Health, United States Public Health 
Jean M. C. O'Connor kindly gave several 
ours of her time in preliminary discussion and made 
many which the 
has benefited from 
by 


esented to 


tional ute 
Service 
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grateful. The manuscript 
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annual convention 


Association, New 
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York, New 


American 
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of Maryland School of Medicine 


pretation consistently to be the primary one 
of three dimensions; while the secondary di- 
mensions (one of which appeared in their 
Study 2 to be either “Emotionality” or “Am- 
biguity”) varied according to the particular 
raters and interpretive stimuli employed. The 
authors (Raush et al., 1956) concluded that: 


although the method of analysis is sufficient] 
developed to allow for the quantification of vari 
ance, the logic underlying it permits us to state with 
confidence that in our data can be 
attributed to the depth of interpretation dimension 
than 


47). 


not 


more variability 


to any of the other dimensions operating (p 


therefore, to examine this 
significant area of consensus between judges, 
in an attempt to illuminate 
basis upon which it rests 

This question has, of 
to the problem of improving the 
reliability of ratings of the Depth variable. 
Decreased application of the 
Depth of Interpretation scale will occur if, 
in addition to the cues of Depth implied by 
the scale itself, there are also present other 
tvpes of cues mot implied by the scale but un- 
wittingly both assumed by raters to be cri- 
teria of Depth and actually utilized by them 
to degrees perhaps varying considerably from 
one rater to the next. It is 
some importance to examine the question of 
what inherent attributes of 
havior are demonstr 


It seems desirable 
the connotative 


course, considerable 
relevance 


reliability in 


consequently ol 


il terpretive be- 
with the 
ed Depth. If some num 
could established 
ibled: to make more 
ich therapist subjects 


correlated 
variable popularly cal 
of these attribute 


per be 
one 


then should be et 
explicit the cues to 

may (consciously or not) in 
the experimental ratir 


tially to control undesirable sources of error 


be responding 


g situation; hence par- 


and 
spect those objectively 


in experiments requit 
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ing Depth ratings: 
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CORRELATES OF 


havior is different from—or, indeed, perhaps 
similar to—other classes of therapist behavior. 

The experiment to be reported was accord- 
ingly designed to examine the correlations be- 
tween ratings of 62 interpretive responses, 
made along the following “dimensions”: 
Depth of Interpretation, Potential for Anx- 
iety-Arousal, Specificity/Focus, Therapeutic 
Skillfulness, and Implausibiltiy. The hypothe- 
sis entertained was that responses rated deep 
(vis-a-vis shallow) would be rated as strongly 
anxiety-arousing, as highly specific and fo- 
cused, and as highly implausible (Fisher, 
1956). It was anticipated that the relation 
Depth and Skillfulness would be 
either linear and positive or, alternatively, 
such an inverted U function as would impute 
greater Skillfulness to interpretations of mod- 
erate, rather than extreme depths. 

It should be noted that the 
the Michigan studies 
from those used here 


between 


procedures 
different 
The Michigan investi- 
gations employed bona fide interview mate- 
rial; whereas, it will be the material 
used in this one was entirely fictitious. Fur- 
thermore, in the present study the raters were 
given no information about the patient’s ver- 
bal responses; whereas, in two Michigan stud- 
ies (Cutler et al., 1958; Harway et al., 1955) 
a “Context” variable was rigorously con- 
trolled. In the experiment by Harway et al. 
however, Depth ratings made under ‘“Thera- 
pist Only” conditions yielded negligible cor- 
relations with ratings made under “Preceding 
Interview” conditions only if the rater was 
totally without information (from whatever 
source) about the areas in which the patient 
was (or had been) talking. This potential 
deficit is partly offset in the present study by 
the considerations that detailed case history 
and psychological test information about the 
patient were presented to the subject at the 
outset, and that the 


used in are 


seen, 


essential referents of 


most interpretive statements were implied by 


the statements per se. It should be noted 
nevertheless, that the procedure used here will 
perhaps incline the judges to rate more on 
the basis of stereotyped personality charac- 
teristics, rather than on the basis of moment 
to-moment variations in awareness—a dif- 
ference which may tend to increase inter 
rater reliabilities. 
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Before proceeding to a description of the 
method, a word of explanation should be 
given as to why the four non-Depth variates 
mentioned were selected for study. Anxiety- 
Arousal ratings were obtained since no in- 
vestigator has yet objectively examined the 
assumption, discursively implied in the clini- 
cal literature on psychotherapy, of a system- 
atic relation between the depth of an inter- 
pretive statement and its potential for arous- 
ing the patient’s anxiety. This would be a 
most important empirical connection to make, 
because of the respective roles of interpretive 
behavior in anxiety-arousal and of anxiety- 
arousal in therapeutic change, and because of 
the distinct status of Anxiety in personality 
theory and research. Ratings of Specificity 
Focus and of Therapeutic Skillfulness were 
solicited because of Howe and Pope’s (1961a) 
findings; while the Implausibility variable was 
incorporated in part to replicate, and in part 
to extend Fisher’s (1956) * finding. 


METHOD 


Subjects. The raters 
3oard-eligible psychiatrists 
lowing institutions: Psychiatric Institute, University 
of Maryland School of Medicine, National Insti 
tute of Mental Health, Walter Reed Army Hospital, 
Walter Reed Army Institute of Research, Chestnut 
Lodge, Springfield State Hospital, Spring Grove 
State Hospital, and North Carolina Memorial Hos 
pital. Slightly half of the 
partly psychoanalytically trained 


mailed, after v« 


were either Board-certified or 


affiliated with the fol 


over subjects were at 
The mate 


rbal agree 


least 
rials described below were 
ment by telephone, to 66 subjects, each of whom 
was assigned to one of five experimental groups. Of 
the 50 sets of materials 


leaving a net total of 48 


returned 2 were eliminated 
(One subject 
rejected had rated all statements 
Categories 1, 4, or 7. The other subject 
most of which p 05 
with all other subjects rating for Depth. It 
sumed that he had 
for end). 

Interpretive 
statements 


sets of data 


whose data were 
only in 
showed negative r’s—for 
was as 
unwittingly the 


reversed scale 


en 
statements. A set of 62 interpretive 
were individually rated by all subjects 
along one of five, annotated seven-point scales. The 
statements had been constructed in appropriate re- 


2 The author wishes to thank Seymour Fisher for 
unpublished copies of the materials and results per- 
taining to his own study, and to acknowledge his 
article as the primary source of several major aspects 
of the technique used here. 

3 Thanks are acknowledged to Hans H. Strupp for 
obtaining the cooperation of five 
Chapel Hill, North Carolina. 


psychiatrists at 
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lation to a 1200-word fictitious case history and psy- 
chological test report (assembled by the author *) 
of a 27-year-old female neurotic patient. An explicit 
effort was made to have the statements appear highly 
credible in relation to the background presented on 
the patient, in order to maximize real correlations 
and to minimize spurious correlations between Skill- 
fulness ratings and ratings along the other four di 
mensions referred to earlier 
ment was typed on a 3 

being provided 


Each interpretive state 
5-inch card, the subject 
seven “marker” cards (num 
bered 1-7) to facilitate the rating procedure. He was 
permitted to re-examine and to revise his ratings as 
much as he might wish before returning them to the 
experimenter. 
Instructions to subjects 


with 


Each subject received the 
same set of 62 interpretive statements, the case his- 
tory and psychological test summary of the patient, 
and a consistent orientation toward the task.5 The 
subject was informed that the 62 statements: (a) 
had been made to the patient by the same therapist 
in interviews ranging up to the 232nd hour; (b) re 
flected the therapist’s personal belief or opinion; and 
(c) had not been uttered either in whole or in part 
by the patient immediately before being verbalized 
by the therapist. While the statements were self 
evidently of an interpretive nature the instructions 
referred to them as “interpretive” only in the group 
of subjects rating for Depth. For all other subjects 
the statements were referred to as simply “therapist 
statements.” 

Instructions to the subjects differed mainly in re 
gard to the dimension along which the statements 
were to be rated. Group 1 rated for “Depth—in the 
usual sense of the term” (Fisher, 1956, p. 250). A 
rating of 1 implied a most shallow interpretation; 
one of 7 the deepest form of interpretation. The sub 
jects in the Depth group were informed that the 
statements had actually been made between the Ist 
and 232nd hours, but they 
“on the assumption that all interpretations were 
made during the first 3 or 4 hours of therapy.’ 
Adoption of Fisher’s lead in this regard was decided 
upon, although in retrospect this is not 
to be a crucial issue 


were requested to rate 


considered 
(As a matter of fact several 


subjects in the Depth group strenuously objected to 


this request on the grounds that it made the task 
seem more unreal than it was already!) 


‘Benjamin Pope made several helpful comments 
during construction of the psychological test report 

5 Copies of the 62 responses, the case history and 
psychological test report, the five sets of instructions 
given the raters, the five sets of statement means, 
and the six scatter plots illustrating the linear rela- 
tions among the four intercorrelated variables, have 
been deposited with the American Documentation 
Institute. Order Document No. 7048 from ADI Aux- 
iliary Publications Project, Photoduplication Service, 
Library of Congress; Washington 25, D. C., remit 
ting in advance $2.25 for microfilm or $5.00 for 
photocopies. Make checks payable to: Chief, Photo- 
duplication Service, Library of Congress 
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rABLE 1 


DESIGN OF EXPERIMENT 


Group 2 rated along a “dimension of Generality- 
Specificity—in other words, in terms of the degree 
of sharpness or focus.” The limits of the annotated 

1 “Highly general and unfocussed: the 
statement is about as general and unfocussed as a 
therapist statement can | ind 7 “Highly 
and focussed: the statement is about 
focussed as a therapist stater 
be noted that in Howe & 
bipolar General-Specifi 
cussed were quite 
Potency 


scale were: 


specific 
as specific and 
vent can be.” (It should 
Pope’s, 1961a, paper the 
Unfocussed-Fo 
highly id on the 
dimension. TI 
seemed subjectively to the present experimenter to 
constitute the least ambig is for the subject’s 
ratings in this portion of tl udy.) It was asserted 
that the statements had been made between the Ist 
ind 232nd hours. 
Group 3 rated th¢ 


scales 
Precision 


scales furthermore 


along a “dimension 
of their potential for Arousal in the pa- 
tient.” A rating of 1 implied “very slight or negli 
gible Anxiety-Arousal” ; rating of 7 
iety-Arousal.” In order lardize the subject’s 
frame of reference the tatements were asserted to 
have been made between the 190th and 232nd hours 

Group 4 rated ctions and relationships 
implied by these statements on a dimension of Thera 
peutic Skillfulness. By a therapeutically skillful re 
sponse is meant one whi iltimately likely to be 
most helpful to the patient.” It was further re- 
quested that the subject “try to ignore any disagree- 
ment you may have with the exact way (i.e., ‘form’) 
in which the therapist has expressed such connections 
and relationships.” ® A rating of 1 implied a state 
ment to be skillful, ultimately of 
little or no help to the patient”; one of 


“strong Anx- 


“the conn 


lichtl 
very i 


“only 


7 “most 


6 An attempt was made impose this 
in order to reduce undesirable « 
ing from the tendency of me 


restriction 
deriv- 
psychotherapists to 
professionally more 
sophisticated when they are stated in a tentative and 
mollified fashion; but there is no objective way in 
which the success of this attempt can be gauged. A 
separate experiment has beer 


rror variance 


regard interpretive respons¢ 


performed in the au- 
thor’s laboratory systematically to examine the in 
fluence of various intensive and probabilistic ad- 
verbial modifications upon ratings of Depth of Inter 
pretation. 
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skillful, ultimately of considerable help to the pa- 
tient.” The statements were asserted to have been 
made between the 190th and 232nd hours of therapy. 

Group 5 rated the statements along “a dimension 
of Plausibility-Implausibility from the patient’s point 
of view.” A rating of 1 implied the patient’s entire 
agreement, and one of 7 his entire disagreement with 
a therapist statement. The statements were asserted 
to have occurred between the 190th and 232nd hours. 

Differential instructions to the five groups of sub- 
jects are summarized in Table 1. All further refer- 
ence to these groups will be made via the labels ap- 
pearing in the second column of the table. 


RESULTS 


Table 2 presents grand means, standard 
deviations and ranges of the five sets of state- 
ment means, Since the interpretive statements 
were entirely fictitious concern was felt about 
the degree to which they would actually be 
endorsed from the standpoint of Skillfulness, 
and about the concomitant effects of low en- 
dorsibility upon ratings by the other four 
groups of subjects. The data indicate that 
while the variance of statement means along 
the Skillfulness dimension was relatively small 
(see Footnote c, Table 2) nevertheless the 
statements as a group appear to have been 
considered at least moderately skillful (M 

4.16: o =.73). It should be noted also 
that somewhat finer discrimination among the 
62 statement means was obtained in the 
Depth and Implausibility groups compared 
with the Anxiety-Arousal and Specificity 
groups (p 05). 

Analysis of variance of data from each 


rABLE 2 


MEAN, STANDARD DEVIATION, AND RANGE 0! 


THE 62 STATEMENT MEANS, BY GROUPS 


Group* vu» Range 


Depth 3.94 1.30 
Specificity 4.87 ‘ 1.67 
Anxiety-Arousal 3.50 1.13-5.3 
Skillfulness 4.16 2.00-5.58 
Implausibility 3.38 1.11 


* For values of N see Table 1. 

b Identical with grand mean for each group. With the ex- 
ception of those between Anxiety-Arousal and Implausibility, 
and between Skillfulness and Depth, every pair of mean values 
yields a significant value of t (df =60, p<.05, two-tailed test). 

© The variances of statement means in the Depth, Anxiety- 
Arousal, and Implausibility groups consistently exceed the 
variances in both the Specificity (9<.05) and Skillfulness (p< 
.001) groups. The variance of statement means in the Im- 
plausibility group is larger than that in the Anxiety-Arou 
group (p <.05) 


TABLE 3 


RELIABILITY ESTIMATES OF RATING 
THE 62 STATEMENTS, BY GROUPS 


group showed significant values of F in all 
cases for both between-statement (p < .001) 
and between-rater (p < .001) variance esti- 
mates. Table 3 presents estimates of reliabil- 
ity of rating under the five conditions. While 
there is considerable variation among within- 
group interrater reliabilities, the reliability of 
average ratings (Guilford, 1956) is either 
moderate or high for four of the groups. 
Depth emerges as the most reliably rated 
dimension, that of Implausibility running a 
close second. Reliability indices for Anxiety- 
Arousal and Specificity groups, while clearly 
significant, are comparatively lower. The soli- 
tary discrepancy involves the Skillfulness 
group, for which the mean interrater 7 of .11 
is not significant at even a conventional level.’ 
This finding makes clear that at least under 
the conditions of this study differential rat- 
ings of Skillfulness were extremely difficult to 
make. It should be noted, however, that this 
difficulty may in part have arisen from an 
unwitting selection of “good” statements, by 
the experimenter, rather than from an in- 
herent inability of the subject to rate along 
the Skillfulness dimension, 

The major interest of the study lies in the 
intercorrelation matrix for the five sets of 
statement means. This is presented in Table 4. 
It is observed that statement mean ratings of 

7 The ratings of a single subject account for 7 of 11 
observed negative values among interrater r’s within 
the Skillfulness group. Even were this subject re 
moved from the matrix, however, the mean r would 


be increased from .12 to only .14. The low 
value of r in this group is thu 


mean 
not appreciably duc 
to the presence of one deviant subject 
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TABLE 4 


MatTRIX OF INTERCORRELATIONS BETWEEN MEAN 
RATINGS OF STATEMENTS RATED ALONG THE FIVE 
DIMENSIONS 


Speci- 
heity 


Anxiety- Implaus- 


Skill 
Arousal bility f 


Group fulness 


Depth e Bag 75 —.07 
Specificity 59* 45 OS 
Anxiety-Arousal 51 238 
Implausibility —.21" 


* Although neither r is statistically significant the two differ 
p <.02, two-tailed test) by comparison of zs transformations 
(McNemar, 1955, p. 148). 


* With df =60, p <001 (two-tailed test). 


Depth, Specificity, Anxiety-Arousal and of 
Implausibility show moderate to high in- 
tercorrelation (values of r ranging from .45 
to .75; p < .0O1), leaving no doubt that 
through whatever mediating mechanism the 
subjects in all four groups were in large part 
rating along the same dimension. 

Skillfulness ratings on the other hand show 
only chance departure from zero correlation 
with the other four sets of ratings (Table 4). 
Plots were made of Skillfulness means against 
each other set of means, in case significant 
curvilinear relations seemed evident. They 
clearly did not. It may be marginally noted 
that Skillfulness ratings show an r of .23 
(p = .10) with Anxiety-Arousal, and one of 
- .21 (p= .10) with Implausibility ratings. 
While no definitive interpretation can be 
made of these observations they leave one 
with a suspicion that under some conditions 
anxiety-arousal might be considered skillful 
and implausibility unskillful.* 


DISCUSSION 
Correlates of Depth 


The results leave no doubt whatsoever that 
the concept of Depth of Interpretation—in 
whatever “usual sense of the term” this was 
understood by the subjects in the Depth 
group—is assumed by psychotherapists to 
have other measurable correlates than Im- 


8In a factor analytic study (Howe, 1962) of the 
dimensionality of ratings made on 10 of the 62 in- 
terpretive responses (equally spaced over the Depth 
dimension), the bipolar scale Plausible-Implausible 
emerges on Factor 2—Professional Evaluation. Anx- 
iety-Arousal/Anxiety-Reduction emerges on Factor 1 
—Depth/Potency. 


plausibility: a deep interpretive statement is 
rated not only as relatively implausible, but 
also as quite specific and quite anxiety-arous- 
ing. 

That ratings of Specificity and of Depth are 
correlated perhaps reflects an intrinsic prop- 
erty of language in and of verbal 
between therapist and patient in 
particular. In this regard the author (Howe 
& Pope, 1961b) has raised the 
question of the dimensional relations between 
elicitation of information and interpretation 
of information. That the Specificity ratings 
of interpretive statements are correlated with 
Depth ratings may indeed lend a hope that, 
operationally speaking, Exploration and In- 
terpretation are not essentially oblique dimen- 
sions, but rather, a unitary dimension.’ This 
important possibility would require rigorous 
study. 


general, 
messages 


elsew here 


It is of interest to consider the implica- 
tions of the fact that more specific messages 
from the therapist (which carry more infor- 
mation) are also assumed mere implausible 
(r = .45) to the patient. It is even intrigu- 
ing to turn this equation round and to in- 
quire about the meaning of Plausibility as 
attributed to less specific—i.e., more general 
—messages. One may suspect that we are in- 
volved here with some form of abstraction 
ladder: compared with deep ones shallow 


interpretations, psychologically speaking, are 


more nearly “universals,” more accessible to 
validation, and harder to disprove. Deep in- 
terpretations, conversely, are not universals 
(except, perhaps, in a restricted psychoana- 
lytic sense), are less accessible to validation, 
and are most certainly harder to prove. Were 
the connotations of Plausibility regarded in 
this light then a serious question would evi- 
dently arise about what Fisher (1956) calls 
the “oversimplified notion” (p. 254) that 
some psychotherapists may select interpretive 
statements for verbalization by using likely 
plausibility to the patient as an inverse cri- 
terion of Depth. For according to the preced- 


ing analysis a plausible statement implies 


® Presently available data from the author’s labo- 
ratory indicate that first and second factors derived 
from ratings of exploratory (e.g., initial interview) 
behavior are highly similar in structure to those 
mentioned in Footnote 8 
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such a degree of generality as to make refuta- 
tion or disproof of it as difficult as the state- 
ment is plausible! 

It seems clear, however, that one can fur- 
ther look at the problem of gauging the depth 
of interpretive statements from the standpoint 
of their potential for arousing the patient’s 
anxiety. This point of view also makes Depth 
fairly and squarely a function of the indi- 
vidual patient at a particular instant, and 
furthermore enables avoid (and in 
part obviates the necessity for) questions as 
to whether a given interpretation, as such, is 
exactly “true” or not. Indeed, the therapist is 
frequently as much (if not more) interested 
in the patient’s response per se to an inter- 
pretation as he is in the net value of the in- 
terpretation for the patient. This viewpoint 
has the further advantage of linking a funda- 
mental operation in psychotherapy to a funda- 
mental variable of general psychological inter- 
est and attention, in the behavior of the pa- 
tient. It remains to be seen, of course, whether 
deeper interpretive responses can in fact be 
shown actually to arouse more anxiety in the 
patient than do shallower ones.. It is quite 
likely that this hypothesis is entirely consist- 
ent with Speisman’s (1959) recent finding. 
In his study Speisman demonstrated that in- 
terpretations of moderate Depth (arousing a 
moderate degree of anxiety?) were more ef- 
fective than either shallow or deep interpreta- 
tions (low or high anxiety-arousing state- 
ments?) both in reducing resistance and in 
maintaining lower levels of resistance. 

The observed correlation between Anxiety- 
Arousal and Implausibility has a further im- 
plication. It is frequently assumed by psycho- 
therapists that an interpretation which the 
patient cannot comprehend, or with which he 
actively disagrees, is more likely to be too 
discomforting than it is to be incorrect. The 
correlation referred to is essentially in line 
with that assumption. It should be noted, 
however, that evidence is thus offered for the 
prevalence, and not for the validity of the as- 
sumption. 

The data generally suggest, as Fisher 
(1956) and Cutler et al. (1958) were well 
aware, that Depth ratings evoke a judgmental 
process involving a variety of cues which have 
never been systematically isolated, to which 


one to 
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different subjects respond in different ways. 
But the difficulty here cannot inevitably be 
overcome by a set of verbal instructions to 
the subject. Cues have a nasty habit of inter- 
acting with each other such that the role of 
any one cue is likely to be a function of its 
rank position in a hierarchy, and of the pres- 
ence or absence of other cues. As Howe and 
Pope (196la) suggested, “regardless of what 
instruction one gives to a rater, he 
the final 


will, in 
inalysis, rate according to certain 
internal mediating cues which only partly 
correspond with whatever explicit cues the 
experimenter is trying to communicate” (p. 
301). Specificity and Potential for Anx- 
iety-Arousal, like Implausibility, involve cues 
which must arouse in the subject a mediat- 
ing response partly similar to that aroused 
by cues of Depth. These several cues may be 
regarded as connotative meanings (Osgood, 
1952) of the term Depth; but this does not 
preclude the possibility that the subjects will 
disagree about the denotative 
term Depth—which they do. 


meaning of the 


A Measurement Problem 


The correlation of .75 obtained here be- 
tween Depth and Implausibility ratings is 
significantly lower (> < than that of 
.87 obtained by Fisher. The difference, objec- 
tively considered, is in no sense serious; 


U5) 


but 
it is instructive to consider in a little detail 
the rather fluid ingredients upon which these 
r’s are based. 

While 


ments 


comparison of interpretive state- 
used in the two studies should be 
made with due caution, it is the experi- 
menter’s firm impression that Fisher’s set of 
60 contained more statements of relatively 
extreme Depths than did the set used in the 
present study (e.g., whereas 12 of Fisher’s 
statements were given mean Depth ratings 
between 1.0 and 1.9, only 1 of the author’s 
statements was so rated). This type of dif- 
ference will not only lead to different em- 


pirical outcomes; in addition (and more im- 


portant) it obviously may lead to spuriously 
high (or low) correlations between Depth and 
other dimensions. This problem will be more 
complex, however, should one be specifically 
interested in the relation between Depth and 


Anxiety-Arousal. The relationship reported 
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here is statistically linear; but it will not turn 
out to be so under all conditions, since the 
two deepest (and incidentally among the least 
skillful) statements have only moderate Po- 
tential for Anxiety-Arousal. When these two 
statements are removed from the set of 62 
the r between Depth and Anxiety-Arousal in- 
creases (as it must) from .59 to .68 (p > .05), 
while that between Depth and Implausibility 
decreases from .75 to .71 (p > .05). The ques- 
tion of what relation exists between ratings of 
Depth and of Potential for Anxiety-Arousal 
is thus further tied with the issue of the 
range of stimuli employed in the study. This 
matter clearly commands attention in order 
that genuine error of measurement shall not 
be obscured from view. 


SUMMARY 


Forty-eight psychiatrists and psychoana- 
lysts assigned to one of five groups rated a 
set of 62 interpretive responses along one of 
five annotated seven-point scales: Depth of 
Interpretation; Specificity; Potential for Anx- 
iety-Arousal; Therapeutic Skillfulness; and 
Implausibility. The interpretations were 
judged in relation to a 1200-word case his- 
tory and psychological test summary of a 
27-year-old neurotic female patient. 

Mean ratings of the interpretive responses 
were found to be significantly intercorrelated 
(r’s from .45 to .75) among the Depth, 
Specificity, Anxiety-Arousal, and Implausibil- 
ity groups, suggesting that the four sets of 
instructions elicited a common mediating re- 
sponse. In general, deep interpretations were 
rated as being more specific, more anxiety- 
arousing, and (as Fisher had earlier demon- 
strated) more implausible. Ratings of Thera- 
peutic Skillfulness showed only chance values 
of r with ratings of the other four variables. 
The four intercorrelated variables showed sig- 
nificant interrater and average-rating reli- 
abilities. Ratings of Therapeutic Skillfulness, 
however, did not. 


Epmunp S. Howe 


Some implications of the finding that Depth 
has correlates other than Implausibility are 
labeled: things it is suggested 
that it may be methodologically profitable to 
regard Depth of Interpretation from the 
standpoint of Potential for Anxiety-Arousal 


among other 
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Increasingly research on effects of treatment 
procedures such as the tranquilizer drugs, psy- 
chotherapy, and electroshock utilizes quanti- 
fied indices of patient symptoms as criteria 
of change or improvement. In fact syndrome 
description often constitutes the only direct 
knowledge of a disorder and its severity in 
the absence of known etiology. Knowledge of 
syndromes also has considerable relevance t 
the practical problems of decision making 

ich as the kind of treatment to administer 
ward com- 


assignment, and eligibility for 
Accurate knowledge of syndromes 


pe nsation 
if available, could make possible develop 
ment of more satisfactory patient classifica- 
tion schemes than are in current use. Thus 
both for practical ind research reasons it is 
important to establish what major syndromes 
actually exist among psychiatric patients, and 
which symptoms define each syndrome 

The purpose of the present study was to 
secure further evidence in support of 10 pos- 
tulated psychiatric symptom syndromes ob- 
Many but not all 


one or 


interview 
identified times in 
factor analyses reported by Wittenborn (Wit- 
& Holzberg, 1951), Degan (1952), 
Lorr (Lorr, Jenkins, & O’Connor, 1955), and 
Guertin (1952). The syndromes hypothesized 


servable in an 


have been more 


tenborn 


may be labeled, for purposes of identification, 

Excitement, Hostile Belligerence, 
Projection, Perceptual Distortion 
Motor Disturbances 
zare positions and movements), Grandiose Ex- 
pansiveness, Thinking Disorganization, Dis- 
orientation, Withdrawal and Retardation, and 
Agitated Depression 

The Hostile Belligerence syndrome has not 
been demonstrated before from interview data 


as follows 
Paranoid 


(hallucinations), (biz- 


although it is well represented in the cli- 
nician’s concept of a paranoid. Likewise With- 


drawal with Retardation has been identified 
the Vete il B tl 
Washington, D. ( 
ton Ho | it i] Pe 


only in ward data. Disorientation, as a syn- 
drome, has been found only by Degan (1952) 


in Moore’s data. 

METHOD 
h 
1 
which consisted of 42 nine 


The data collecting device was t 
dimensional Psychiatric Scale (I) 
Form I, 


rating s¢ 


e | paticr Multi 
PS) Experimental 
point unipola 
ules and 48 check list items rated 
ibsent 

Each postulated 
rating schedule by te 
in random order. IMPS 
terviewer following i 
interview. 

Ratings were collect 
of 47 hospitals. A 
and a silent observ 
tient. A total of 
each being re} 
patterns: anxiot d é i, n ced delusiona! 
hostile-belligeren ranoid, apathetic 
and unmotivated, markedly | cinated, disorga 
ized in thought ocesses, retarded and depresse 
and manic-excit 
representative 
fessional per 
pose of thi 
assure the pres fa VI 

Each of the nine-point scak 
the mediar d the individual 
dic hotor 
obtain 
analys 
ing pri 
simple 
zero 
columns 
disori 1 combined into a 


score re s germane ! nd thu 


Nine 


not | d to d : parat olumn 


able e | unassigned ister. By ap 


catio i ft weight matri i! actor vere 
tractes 

indicat 

extra 

subjec ed 

the object 


SULTS 
The criterion set for confirmatior f the 


postulated syndromes was that each symptom 
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Lorre, 


TABLE 1 


Factor A: EXxcIreMENT 


Scale variable 


Unrestrained in s! 

Speech hurried « 

Mood elevated; euphori 

Exhibits attitude of superiority 
Dramatizes self and seeks attention 
Speech loud, boisterous or intense 
Body movements hurried 
Overtalkative 

Dominant or controlling 


have its only significant factor projection on 
the syndrome to which it was allocated. The 
point of significance was arbitrarily set at 
.30. Examination of the oblique factor matrix 
after rotation revealed that only in one in- 
stance did the highest factor loading for a 
scale variable fall into a column to which it 
was not allocated. The symptom “attitude of 
superiority” has its highest projection on A 


McNarr, KLETT, AND LASKY 


rABLE 2 


Factor C: PARANOID PROJECTION 


rather than H as hy 
ideas has a secondary 


pothesized. Recurring 

projection (.30) on 
Perceptual Distortion. Five of the symptoms 
defining Thinking Disorganization, although 
located as predicted, emerge with factor pro- 
jections less than .30. The major reason seems 
to be that few inaccessible incoherent patients 
were represented in the sample. Hence the 
definition of the syndrome is weak. 

Thus with respect to the criterion set here, 
eight of the syndromes are confirmed almost 


TABLE 3 


Factor H 


Number 


Expre sses feeling of host 


Exhibits attitude of contempt 


HostTILt 


SELLIGERENCI 


Scale variable 


lity towards others 


or disdain 


Manifests hostile or sullen attitude 


Shows irritability or annoyance 


Blames others for difficulties 


Expresses feelings of bitterness and resentr 


nent 


Complains, gripes, and finds fault generally 


TABLE 4 


Factor E 


AGITATED DEPRESSION 
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TABLE 5 


Factor I: WITHDRAWAL WITH RETARDATION 


Factor 


Scale variable loading 


wed or del rate 53 
is l 52 

36 

48 

37 

5s? 


48 


completely. Thinking Disorganization, how- 
ever, is weakly represented. The disorienta- 
tion syndrome will be discussed later. 

In describing each factor the letter used for 
identification is the same as applied in ear- 
lier studies. The simple descriptive labels 
given the syndromes are designed to avoid 
confusion with psychiatric diagnoses which 
are patient groupings rather than symptom 
syndromes. Factor A in Table 1 represents an 
Excitement Syndrome very much like a clas- 
sical Manic-Excitement. Degan (1952), Wit- 
tenborn (Wittenborn & Holzberg, 1951), and 
Lorr, Jenkins, and O’Connor (1955) have iso- 
lated a very similar symptom cluster. 

Factor C defines a syndrome of morbid sus- 
picion and attribution of blame to others that 
is called Paranoid Projection. The cluster pre 
sented in Table 2 represents one of the best 
established syndromes. Wittenborn and Lorr 
et al. have isolated this factor repeatedly. 
Degan failed to do so because no defining 
variables of suspicion were included in the 
data analyzed. 

A syndrome positively related to but inde- 
pendent of both Excitement and Paranoid 
Projection is Hostile Belligerence, Factor H. 
(See Table 3.) A very similar pattern has 
been found in ward data by Degan (1952), 
Lorr et al. (1955), Guertin (1952), 
Smith, Lasky, Hover, 


and 


Bostian, and Ging 


TABLE 6 


TOR F: PERCEPTUAL DISTORTIONS 


Factor 


Scale vari loading 


Speaks to voices 32 

Reports hearing voic 63 

Reports seeing visions 39 

eports other hallucinations 40 

s voices accuse or blame him 67 

*ports voices tell him to do t zs 53 
*ports voices threater 


(1959). The hostile acting out is as much a 
syndrome as Excitement and agitation. Moore 
(1933) is one of the few who recognize this 
syndrome. 

Factor E, Agitated Depression, describes 
a morbid intropunitiveness. Wittenborn and 
Lorr et al. have repeatedly reported this syn- 
drome. The correlations of the syndrome are 
typically zero or slightly negative with all 
other syndromes except Withdrawal and Re- 
tardation. In second order analyses also, Fac- 
tor E is unique. 

A related factor is I, labeled here With- 
drawal with Retardation. This syndrome or 
one very similar has been isolated only by 
Guertin (1952) in interview data. In previous 


factor analyses similar factors have appeared 
in ward data only. Inspection of the variable 


reveals two elements: psychomotor retarda- 
tion and apathy. Within a mixed sample of 
psychotics this can be expected to occur. 
However, within a group of patients diag- 
nosed retarded-depressed Hamilton (1960) 
has shown that only psychomotor retardation 
is exhibited. 

The syndrome we call Perceptual Distor- 
Factor F, has identified in 
Moore’s data. Lorr et al. have isolated this 
cluster in every analysis reported. The lack of 
sufficient reference variable accounts for its 
absence in the Wittenborn studies. A related 


tion, or been 


TABLE 


Rae 
stinct 


or extol him 
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but narrow syndrome is Grandiose Expansive- 
ness. While this factor is limited in impor- 
tance it does appear in nearly every study. 
Wittenborn calls a similar factor Paranoid 
Condition. 

The Motor Disturbances syndrome, Factor 
G, represents aspects often labeled catatonic 
although the hebephrenic reaction is also said 
to be defined by this cluster of symptoms. 
Factor K, or Conceptual Disorganization, has 
the broadest relations of any of the syn- 
dromes. In the second order analysis it has 
positive loadings on all three factors. It is 
negatively related only to Agitated Depres- 
sion. Conceptual Disorganization is thus cen- 
tral to nearly all of the syndromes called 
schizophrenic and is even associated with Ex- 
citement. 

As indicated earlier the six disorientation 
items were summed and represented by one 
score. However, it is clear from its systematic 
low correlations across all variables and its 
low communality that this group of variables 
defines an independent syndrome which is 
called Disorientation. 


KLETT, AND LASKY 


rABLE 8 


Factor G: Mot DISTURBANCES 


Factor 
loading 


Grins in a meaningles (.21) 
Makes unusual facial gr Ace 58 
Assumes bizarre postures 50 
Exhibits peculiar re 60 


Second Order F 


ictors 


In order to isolate and interpret any second 
order factors, the correlations among the pri- 


mary syndromes were computed. Four factors 
appeared to account for all of the variance. 
However only the orthogonal factors that 
were interpretable are presented in Table 11. 
Factor X clearly represents a bipolar Excite- 
ment versus Retarded Withdrawal. Previous 
analyses left the question of its bipolarity in 
doubt. It should be that incoherence 
and thinking disorganization as well as hos- 
tility define the positive pole. Motor Disturb- 
ances also has a small loading on X. 

Factor Y is tentatively labeled 


noted 


Hostile 


TABLE 9 


Factor K 


Number 


Responses are irrelevant 


nN — — 
Mme NO NM be 


Uses neologisms 


TABLE 


(CORRELATIONS OF 


12 
04 
+] 
15 


36 


Note. have been o 


-Decimal point 


CONCEPTUAI 


DISORGANIZ 


Scale variable 


Speech is logically disconnected or incoherent 
Rambles off the topic discussed 
Deficit in memory for events of last week 


Repeats words and phrases in a meaningless way 


10 


PRIMARY 
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rABLE 11 


Matrix oF SECOND OrpER Factors 


Excitement 

Paranoid Projection 

Agitated Depression 
Perceptual Distortion 

Motor Disturbances 

Hostile Belligerence 
Withdrawal 

Grandlosity 

Conceptional Disorganization 


Paranoia. The syndrome is defined by Para- 
noid Projection, Hostile Belligerence and to 
a lesser degree by Grandiosity and Thinking 
Disorganization. A similar factor has not been 
isolated probably because the hostile element 
has not been included before. A reading of 
the description of schizophrenic reaction, 
paranoid type, in Mental Disorders (Ameri- 
can Psychiatric Association, 1952) indicates 
a Close parallel to the syndrome defined here. 

The third factor Z represents a syndrome 
previously labeled Thinking Disorganization. 
It is defined by Perceptual Distortion, Gran- 
diosity, and Conceptual Disorganization. 
Degan isolated a similar syndrome he called 
Hebephrenic Schizophrenia. However, the syn- 
drome is a poor match for schizophrenic re- 
action, hebephrenic type as defined in Men- 
tal Disorders. 

The second order factors may be regarded 
as broad patterns made up of the first order 
syndromes. As such there is no evidence in 
support of Eysenck’s contention that there is 
a single second order factor of psychoticism. 
There is the possibility, however, that there 
exists a third order factor defined principally 
by disorganization of thinking. 


Ten Psycuori 


SYNDROMES 


SUMMARY 


Ten postulated psychotic syndromes were 
confirmed, in varying degrees, in a broad 
sample of psychotic patients. Three broader 
syndromes, a bipolar Excitement versus With- 
drawal, a Hostile Paranoia, and a Thinking 
Disorganization, were also isolated. Two of 
these have been found previously. To this ex- 
tent use of these syndromes in research, to 
evaluate therapy and to identify biochemical 
-orrelates of psychopathology, is more soundly 
based. 
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CORRELATION BETWEEN RAVEN PROGRESSIVE 
MATRICES AND THE WAIS 


HUGH N. McLEOD anp JOSEPH RUBIN 


Toronto Psychiatric Hospital, Canada 


An analysis of the relationships between 
the Raven matrices (Rev. Ed. 1956) and the 
WAIS (1955) was undertaken at this hos- 
pital for cases where both tests had been 
administered. The population for this study 
consisted of day care, forensic, and inpa- 
tients, most of 


whom were diagnosed as 


neurotic. An examination of the test scores of 
these groups indicated sufficient homogeneity 
to justify analysis as a single group.’ 


tion of .72 between an abbreviated matrices 
(30 items) and Full Scale WAIS, using 82 
cases, and .58 and .70 for the Verbal and 
Performance scales respectively. 

The relationship between the Full Scale 
WAIS and Raven percentile scores was con- 
veniently represented by two visually fitted 
regression lines, y = .56 (Raven percentile) 
+77 for the percentile range 0-45, and y= .32 
(Raven percentile) + 88 for the percentile 


TABLE 1 


AGE AND INTELLIGENCE SCORES OF GROUPS 


Mean age 
Subjects (years) 
Total ; 30.4 
Male 


Female 


79 


a/ 


35.7 


Table 1 describes the ages and test scores 


of the males, females, and the group as a 
whole. 

Table 2 shows the correlations (rho) be- 
tween Raven matrices and WAIS scales for 
males, females, and the group as a whole. 

Our findings were in essential agreement 
with Desai (1955) and Hall (1957). Desai 
reported a correlation of .57 between the 
and the Wechsler-Bellevue Verbal 
scale on 190 cases. Hall reported a correla- 


matrices 


1We are indebted to Jean Brown, William Otto, 
William Lawrence, and William Currie for their aid 
in the collection of data 


WAIS Full Scak 


matrices 


Sp 


30.92 
29.37 


30.50 


Subjects 


Note None of correlat 
* Significant beyond .05 le 
* Significant beyond .O1 k 


*** Significant beyond .001 le 
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PROGRESSIVE MATRICES AND THE WAIS 


range 46-100. These regression lines allow 
for the prediction of the WAIS Full Scale 


; te , ‘ relationship of the Wechsler 
10 IQ points in 73% of the cases. An in- 


, ’ . 2 Bellevue Verbal scale and the Progressive Matrice 
spection of the scattergram relating matrices Test. J. consult. Psychol. 1955. 19, 6 
and Full Scale WAIS scores indicated that 4... Jutta C. Correlation of a modified 
the degree of relatedness between the two oasis Waites  Wislnle 193 
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SOME PERSONALITY DIFFERENCES BETWEEN 
RESPONDERS AND NONRESPONDERS TO 
A SURVEY QUESTIONNAIRE 


BERNARD LUBIN, EUGENE E, LEVITT, anpD MARVIN ZUCKERMAN ?} 


Indiana University Medical Center 


In addition to the loss of data pertinent to 
the content of a questionnaire, the character- 
istics of nonresponders constitutes a poten- 
tially important but rarely known source of 
bias in survey investigations. 

Some information concerning personality 
differences between responders and nonre- 
sponders became available as part of a larger 
investigation on the measurement of depend- 
ency (Zuckerman, Levitt, & Lubin, 1961). 
The Edwards Personal Preference Schedule 
(EPPS) was administered to the entire class 
of sophomore student nurses (NV = 72). Ap- 
proximately one month later, a questionnaire 
requesting opinions on group counseling serv- 
ices was mailed to the group. The instructions 
were to complete the questionnaire and to re- 
turn it in the envelope which was provided. 
Fifty-four completed questionnaires were re- 
turned. 

Responders were significantly higher on n 
Order and on the Dependency Ratio,? and 
nonresponders were significantly higher on 
n Aggression.*® Differences between the two 
groups on some of the other variables do not 
attain significance but are worth noting. Re- 


1Now at Department of Psychology, Brooklyn 
College 

2 The Ratio obtained by 
summing the subject’s scores on n Deference, n Suc- 
corance, and n Abasement, and dividing by the sum 
of the subject’s scores on n Dominance, and n Au- 
tonomy. 

3 Two-tailed ¢ tests. 


Dependency score was 


sponders tend to be higher on n Deference, 
n Succorance, n Achievement, n Nurturance, 
and n Endurance. On the other hand, non- 
responders tend to be higher on n Dominance, 
n Autonomy, and n Intraception.* 

Reuss (1943) reported on the socioeco- 
nomic characteristics of nonresponders and 
Suchman and McCandless (1940) discussed 
the role that the subject’s interest in the 
content of the questionnaire plays in deter- 
mining the completion of mail questionnaires. 
The present findings suggest that personality 
factors are an additional class of variables 
which operate to influence the act of re- 
sponding. 

*A copy of the table comparing the return and no 
return group on the EPPS variables has been de- 
posited with the American Documentation Institute. 
Order Document No. 7045 from ADI Auxiliary Pub- 
lications Project, Photoduplication Service, Library 
of Congress; Washington 25, D. C., remitting in ad- 
vance $1.25 for microfilm or $1.25 for photocopies. 
Make checks payable to: Chief, Photoduplication 
Service, Library of Congress. 
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THE RELATIONSHIP OF SHADING ON THE DRAW-A-PERSON 
TEST TO MANIFEST ANXIETY SCORES 


RAY A. CRADDICK, WILLIAM D 


LEIPOLD, ann PETER D. CACAVAS 


University of North Dakota 


The Draw-a-Person Test (DAP) has not 
borne the scrutiny of research too well (Swen- 
sen, 1957), and little has been accomplished 
experimentally to substantiate the sweeping 
interpretations given to the 
Machover (1949). 

An important consideration of personality 
revolves about the concept of “anxiety.” Ac- 
cording to Machover (1949), “Any degree or 
type of shading is considered an expression of 
anxiety” (p. 98). No studies have been spe- 
cifically designed to investigate Machover’s 
hypothesis. 


drawings by 


This study was an investigation of the hy- 
pothesis of Machover that shading is an in- 
dicator of anxiety. Anxiety was operationally 
defined as a score achieved on the Taylor 
Manifest Anxiety scale (JA scale) (Taylor, 
1953). 

The subjects used were 272 freshman uni- 
versity students (151 females and 121 males) 
from an introductory psychology course. All 
subjects were asked to draw a person accord 
ing to the directions given by Machover. In 
addition, all subjects were given the MA scale, 
these scores representing the external criterion 
of anxiety. Two graduate students were asked 
to rate the drawings for evidence of shading 
(using the Merriam-Webster dictionary defi- 
nition, and comments by Machover regarding 
the presence or absence of this feature in the 
drawings). Two other graduate students, each 
with at least 3 years of clinical practice in- 
cluding experience with the DAP, were asked 
to rate the drawings for evidence of anxiety. 
The latter raters were instructed to use any 
criterion they wished in making their deci- 
sions. All raters were asked to make a forced 
choice on any drawings for which they were 
not certain of their discriminations. 


A 2X2 chi square analysis between the 
raters of shading and the raters of anxiety 
yielded values significant beyond the .01 level 
(df = 1, using Yates correction). In fact the 
percentages of agreement was 80.5 between 
the raters of shading and 65.19 between the 
raters of anxiety. The mean MA scale scores 
for males and females were computed sepa 
rately, and a ¢ test indicated significant dif- 
ferences (p less than .02 with df = 270) be- 
tween the means. Because of the high reli- 
ability between raters, one rater for shading 
and one for clinical anxiety were used to com- 
pare the MA scale scores of the two sexes 
(using the mean MA scale score for each sex 
as the cut-off point) in a 2 X 2 chi square 
analysis. These results, however, did not reach 
a statistically significant level. 

In failing to support Machover’s hypothesis 
that shading indicates anxiety, the study con- 
formed to the majority of findings noted by 
Swensen (1957) in his review on the DAP, 
but does not rule out the utility of the test 
in clinical diagnosis. Since both shading and 
anxiety could be judged with considerable 
agreement between 
raters, the test does consensual 
validation—or at least, reliability. The lack 
of relationship shown with the MA 
mean that the 
“aspects of anxiety.” 


clinically sophisticated 


have some 
scale 


scores may only two tests 


measure different 
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BRIEF REPORTS 


AUTONOMIC NERVOUS SYSTEM MEASURES AND FACTOR 
CORRELATES WITH PERSONALITY INDICES IN 
A TUBERCULOUS POPULATION ' 


EARL D. MARKWELL, Jr 


University of Arkansas Medical Center 


Wenger (1948) has shown that autonomic func- 
tions bear significant relationships to some per- 
sonality measures. 

During a resting state, 75 tuberculous subjects 
were subjected to 19 physiological measures of 
autonomic nervous system function. Four vari- 
ables (dermographia latency—DL, diastolic blood 
pressure—D BP, finger temperature—FT, and sub- 
lingual temperature—ST) and an autonomic fac- 
tor score (A score) were correlated with MMPI 
T score values. 

Means and standard deviations computed from 
MMPI values when compared with those of other 
tuberculous samples available revealed that the 
present sample of subjects was representative of 
the larger hospitalized population from which it 
was drawn and that the psychological makeup of 
the subjects is representative of that for several 
samples of tuberculous patients as a whole. 

Intercorrelational analysis between discrete au- 
tonomic variables and MMPI scores revealed that 
4 of the possible 52 relationships were signifi- 
cant at the .05 level, and one was significant at 
the .01 level. The data also show that 2 of the 
possible 13 relationships between A and MMPI 
scores are significant at the .05 level, and one is 
significant at the .01 level. The relationships 
shown above must be interpreted cautiously since 
they are only slightly more frequent than would 
have been predicted on a chance basis alone 

The largest relationship was the positive one 
between high ST (apparent sympathetic domi- 
nance) and high D scores. The relationship Funk- 


1An extended report of this study may be ob- 
tained without charge from Earl D. Markwell, Jr. 
(Humboldt State College; Arcata, California) or for 
a fee from the American Documentation Institute 
Order Document No. 7049 from ADI Auxiliary Pub- 
lications Project, Photoduplication Service, Library 
of Congress; Washington 25, D. C., remitting in ad- 
vance $1.25 for microfilm or $1.25 for photocopies 
Make checks payable to: Chief, 
Service, Library of Congress 


Photoduplication 


enstein, Kings, and Drolette (1957) infer may be 
helpful in interpreting the data. They report evi- 
dence of a low norepinepherine/epinepherine ratio 
for children, Anger-In or Anxiety, and depressed 
or anxious patients. Relationships between DL 
and FT and Hy and Pa scores are positive. High 
Hy and Pa tendencies toward 
emotional distress of a neurotic variety; whereas 
high F scores indicate 
lated withdrawal of a schizoid type.” 
that schizophrenic patients m 


scores evidence 


tendencies to ~encapsu- 
Evidence 
inifest more sympa- 
thetic dominance than do neurotic subjects seems 
to support the relationship between DL and F 
scores. Since both high Hy and Pa scores are par- 
tially reflections of hostile attitudes and feelings 
the formulations of Funkenstein et al. (1957) can 
again be useful. Anger-Out and paranoid patients 
evidence a high norepinephrine/epinephrine ratio 

High A scores (apparent parasympathetic domi- 
nance) are positively associated with high Pd and 
Pa scores. The Pd-Pa pattern has been found use- 
ful in identifying character disorders and under- 
achieving students. No studies of autonomic nerv- 
ous system functioning of the present kind have 
been reported on psychopathic or character dis- 
order groups. The present data 
correlations are involved, suggest that apparent 
parasympathetic dominance may be associated 
with the presence of emotional malad- 
justment. The present findings would also seem 
to support Wenger’s earlier 


work. He reported 
that A scores were positively associated with de- 


unless spurious 


pression and emotional instability and negatively 


related to objectivity 
01 level 


ind cooperativeness at the 
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THE BARRON 


EGO STRENGTH SCALE 


AND 


PSYCHOTHERAPEUTIC OUTCOMI 


HERBERT GETTER 


Columbus P 


Barron's (1953) original finding was that the 
Ego Strength (Es) correlated positively 
with improvement in psychotherapy, r = ca. 0.45 

The present study was designed to determine 
the relationship between Es, several patient vari 
ables 


scale 


and two indicators of psychotherapeutic 
outcome: improvement; and the number of hours 
the patient remained in therapy. Improvement 
was based on a dichotomous rating by each pa- 
tient’s therapist at the time of termination. The 
patient variables that were included in the study 
and whether they 


ment after being accepted by the clinic 


were sex, age went into treat 


The san 


vchotherapy 


candidates for ps 
at an out-patient psychiatr 
1 


ple consisted of 59 


was divided into three group 
who were rated 
who were rated improved 


treated; 


rhe findings were as fo 
cant relationship was found 


1 The contributions of Alvin 
State University and Willis C 
lumbus Psvchiatri Clini 
edged 

An extended report of this study ay be 
without Herbert Getter 
Department, Ohio State Universit) 
Ohio) or for a fee from the American Documenta 
tion Institute. Order Document No. 7050 from ADI 
Auxiliary Publications Project, Photoduplication Serv- 
; Washington 25, D. C., re- 
mitting in advance $1.25 microfilm or $1.25 for 
Make checks payable to: Chief, Photo- 


luplication Service, Librar Congress 


obtained 
(Psychology 
Columbus 1 


charge from 


ice, Library of Congres 


photocopies 


anp DONALD M. SUNDLAND 


ychiatric Clinic, Ohio 


provement, or Hours in ‘Therapy, or Acceptance 
of Treatment, (b) Es correlates si (= 25 
with Age, and .55 (p< .01) 
generally getting higher scores on the E 

The results of this study 
3arron’s original findings 


vith Sex—male 
scale 
appear to « ontradict 
and reveal that the sex 
of the respondent accounts for an important part 
of the variance of the Es scores. Examination of 
the items of the Ego Strength scale suggests that 
the respondent is admitting or denying various 
psychological That the 
males in our sample are less likely to admit weak- 
nesses seems to be explainable by their cultura 
sex role. In the American culture. it is more a 
ceptable for females to admit inadequacies 
Gottesman found a relation 
tween Es 
His interpretation of this was that Es ref 


ability to 


or physic il weaknesses 


(1959) 


and defensiveness as measure 


socialy aesira 


recognize 


; 


ions of personality.” The sex differer 


present study suggest that high E 
counted for by cultural sex 
no reason to believe that 


discriminate 





ON DIRECTION IN 


HERBERT 
Veterar 


Admini 


In order to establish base rates for figure draw 
ing directionality, the first DAP of 4 


students was inspected for 


80 college 
direction. Less than 
60% of the figures faced foreward and there were 
more left-facing than right-facing figures among 
right-handed subjects. Among left-handed 
jects there were about as many 


sub- 
left-facing as 
right-facing figures. 
These data indicate a handedness difference in 
direction of figure drawings. To investigate such 
a difference, a second study was done to investi- 
gate specifically the relation of handedness to 
drawing directionality. A grou | 375 college 
students was instructed to 
Again right-handed sub 


facing figures than right-facing 


p O- 
more left- 


in 
ts drew 


file 
figures, while left 
handed subjects drew about an equal number 
facing in the two directions. A chi square test ol 
independence testing the hypothesis that di 


tion of drawn f 


rec- 


is independent of handed 


igure 
gave X 17.83 (p< .001) 
handedness difference 


While it 


such a handedness difference 


ness 
These results point to a 


in drawing directionality is possible that 
reflects a differenti 


nfluence of higher centers (as side of brain don 
ance) on drawing directionality. the difference 


in direction may be related to a simple 
sensory-motor variable; viz., whether the 
is held in the left vs. the right hand 

This latter hypothesis was tested 
A group of 


peripher: 


study 22 right-handed sub 
instructed to draw a person in profile 
nstructed to hold the pencil in the 
and half in the left hand. Subjects usir 

An 
tained without 


Hospital ; 


ig tne 
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DRAWING A PERSON ? 


F. CROVITZ 


Durh 


hand drew more left-facing than right-facing pro 
files, while subjects using the 
right-facing than left-facing figures (Fisher exact 
probability = .025). The data support the hy- 
pothesis that drawing directional is related to 
the 


left hand drew more 


ity 
hand used in drawing 

A common report of 
that it 
figure 


right-handed subjects is 
o draw a left-facing 
of such subjects re- 
veals a common movement pattern 

to the body the ult 


“somehow easier” t 


iS 
Informal observation 
The elbow is 
ir surface of the hand 
ontact witl > writing the hand 

point is on an 
rst pencil stroke 
his counterclock- 


in ¢ 


is 


suriace 
is in line with the arm, the penc 
extension of this line nd the f 
is laid down by a wrist fl n. 7 
wise pencil stroke 
left 


forehead of 


avs the 
fi is held in 


igure. When the pencil 


f ng 
bACinike il 


pre S¢ 


left-facing 

An hypothesis 
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THE 


RELATION BETWEEN PARENTAL ATTITUDES TOWARD 


CHILD REARING AND CHILD BEHAVIOR? 


MICHAEL ZUNICH 


Texas Woman’s University 


Comparatively little empirical evidence is found 
in the literature about how parent attitudes to- 
ward child rearing are related to children’s be- 
havior. This study tests the hypothesis that pa- 
rental attitudes toward child rearing are signifi- 
cantly related to selected behavior of children. 

The subjects were 40 children enrolled in the 
preschool laboratories at Texas Woman’s Univer- 
sity. Half were female and half male 
age from 

Direct 


ranging in 
years 8 months to 4 years 9 months 
utilizing a 


observation time-sampling 


technique and predetermined categories was con- 
ducted in the preschool laboratories during regu- 


lar school hours, through a one-way vision mirror. 
The child’s behavior was observed and recorded 
according to categories similar to those developed 
by Merrill (1946) and Bishop (1951). Before 
final data reached 
by discussion between the observers concerning 
types of behavior defined by each of the behav- 
ior categories and their accurate placement within 


collection, an 


agreement was 


a given time interval. The observer reliability for 
each behavior category wa ilso established. The 
child’s rated and recorded under 
the appropriate category every 5 seconds during 
a 5-minute 


behavior was 


} , - 
observation interval venty-four 
made on ea making 


observations were 


> duration 


parental 


a total of 960 observations of 
In order to obtain me 


isures of 


An extended 
tained without 


report of this study may be ob- 
from Michael Zunich (Texas 
Woman’s University; Denton, Texas) or for a fee 
from the American Documentation Institute. Order 
Document No. 7052 ADI Auxiliary Publica- 
tions Project, Photoduplication Service, Library of 
Washington 25 remitting in ad- 
vance $1.25 for microfilm 5 for 
Make checks Chief 


Service, Library of Congress 


charge 


from 


Congress; 
photoc opi S 


payable to Photoduplication 


tudes toward child rearing, parents were given the 
Parental Attitude Research Instrument (PARI) 
Only subscales pertaining to child rearing were 
used in this study. 

Of the 288 comparisons made by means of 
Spearman rank correlation coefficients computed 
between frequencies in the 18 child behavior 
categories and the 16 attitude subscales, 23 evi- 
denced significant at the .05 
beyond. Many of the significant correlations were 
the same for and paternal attitude 
the attitude subscales “Ap- 
Encouraging Verbalization,’ 

were related to the behav- 


relations level or 
maternal 
For example 
proval of Activity,” ‘ 
and “Equalitarianism 


scores 


ior category “Contact” for both parents 
The lack of statistical relations 
tween child behavior and parental attitudes sug- 


gests several hypotheses, e.g., (a) the 


observed be- 


behavior 
categories were not specifically designed to yield 
measures of phenomena assessed by the PARI 


and/or (5) it is possible that the childs’ 
+ } 


behavior 
during preschool hours may no 
of his behavior at 
were made in demonstration laboratories, the 
child may have behaved more socially accept 
ably than in the home situation. However, if the 
investigator obtained a reasonably close approxi- 
child 

: 


that behavior of middle-cl 


e representative 


home. Since the observations 


mation of behavior, the findings 


suggest 
iss children cannot be 
predicted from an analysis of parental attitudes 


toward child rearing 
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A NOTE ON THE FACTOR STRUCTURE OF THE 
NEUROPSYCHIATRIC CHRONICITY SCALE 


JAMES M. ANKER 


Veterans Administration Hospital, Perry Point, Maryland 


Recently the author (Anker, 1961) reported a 
21-item scale which predicted length of stay for 
recent admissions to neuropsychiatric hospitals. 

The true-false responses to each of the 21 scale 
items for 265 male subjects used in the previous 
study were coded and punched in IBM cards. 
Tetrachoric correlations between the items were 
obtained directly from the computing diagrams 
of Chesire, Saffir, and Thurstone. The intercor- 
relation matrix was factor analyzed by the com- 
plete centroid method on the IBM 650. The fac- 
tor loadings were rotated analytically using the 
orthogonal normalized varimax criterion on the 
IBM 650. The orthogonal rotated loadings were 
then rotated subjectively by the method of two- 
dimensional sections to oblique simple structure 
The oblique axes were those that appeared to 
best describe the factor space. The correlations 
between the oblique primary factors were calcu- 
lated. 

The complete centroid computations produced 
loadings on the 21 items for six factors. While 
the loadings varied between the orthgonal and 
the oblique solutions, the items defining the fac- 
tors were essentially the same. The estimated 
amount of total common variance accounted for 
by each factor and a brief factor description fol- 
low: Factor A (49.4%)—personal inadequacy, 
helpless in a hostile environment; Factor B 


(16.5% )—self-acceptance, sense of personal well- 


being; Factor C (12.4%)—social 
personability; Factor D (8.7%) 
tification 


extroversion, 
masculine iden- 
Factor E 


heterosexual adjustment; 


1An extended report of this study may be ob 
tained without charge from James M. Anker (Co- 
ordinator of Research, Psychology Service, Veterans 
Administration Hospital; Perry Point, Maryland) or 
for a fee from the American Documentation Insti- 
tute. Order Document No. 7053 from ADI Auxiliary 
Publications Project, Photoduplication Service, Li- 
brary of Congress; Washington 25, D. C., remitting 
in advance $1.25 for microfilm or $1.25 for photo- 
Make check payable to: Chief, Photodupli 


cation Service, Library of Congress 


copies 


withdrawal, 
(5.7% )—compulsive 


(8.5% )—defensive 
Factor F 
formity. 

The first three, perhaps the first four, factors 
appear to be reasonably well defined. Intercor- 
relations between these oblique primary factors 
range from +.31 to —.54. Predictions of chro- 
nicity based upon specific factor scores for the 
first four factors generally were significantly bet- 
ter than chance or base rate. Item social desir- 
ability values indicate the potentially chronic pa- 
tient uniformly views himself undesirably. 

The factors obtained, particularly the first 
three, provide a meaningful description of the 
item response patterns likely to be produced by 
recently admitted neuropsychiatric patients des- 
tined to become chronic. Further, insofar as such 
inferences can be made from self-report ques- 
tionnaire data, a description of personality fea- 
tures predisposing one to neuropsychiatric chro- 
nicity has been obtained. Significantly, the MMPI 
items which have been found to discriminate be- 
tween long-term and short-term patients gener- 
ally are not “crazy” items, i.e 
tapping symptomatic severity of psychopathol- 
ogy. Further, the personality descriptions given 
by the factors do not fit neatly into the cur- 
rent diagnostic system. Increased understanding 
of those personality characteristics which are 
uniquely typical of the future chronic patient 
should provide new approaches to the prevention 
of neuropsychiatric chronicity. Although a num- 
ber of studies have been done on new treatment 
procedures for chronic patients, it seems that the 
source of greatest promise lies in early detection 
and the utilization of treatment procedures de- 
signed to prevent this o 


oversensitivity ; 
control, con- 


those directly 
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PERSONALITY VARIABLES ASSOCIATED WITH ADMITTED 
CRIMINAL BEHAVIOR? 


ARON WOLFE SIEGMAN 


University of Maryland School of Medicine 


A recent factor analytic study (Peterson, Quay, 
& Cameron 


1955) of two objective personality 


questionnaires of demonstrated ability in dis- 
criminating between institutionalized delinquents 
and nondelinquents produced the following clearly 
identifiable factors: (Factor I) psychopathic per- 
sonality traits, (Factor I1) emotional maladjust- 
ment, and (Factor III) 
ciologists, however, have pointe d out that the re- 
lation between actual criminal behavior and offi- 
cial adjudication for such behavior is far too at- 
tenuated for the latter to be a useful criterion of 
criminality. Some sociologists have suggested that 
anonymous admission of criminal behavior may 
be a more valid measure of criminality. The 
present study investigated the relation between 
anonymously admitted criminality and Factors I 
II, and III, and two other variables which have 


familial dissension. So- 


been assumed to be related with criminality: low 
n Achievement and religiosity 

A group of 54 female and 
it Bar-Ilan University 
admitted 


male students 
Israel, completed anony- 
(ACB) 
Quay and 
e (the latter 
and III) 
a nine- 
(RBS), and 


-rating scale 


} 
scaie 


mously an 
the Gough 


criminality 
(So) sc ile 
Peterson’s delinquency (De) scal 

two contain the items of Factors I, II 
Achievement (n Ach) 
point religious belief self-rating 


Socialization 


Edwards’ n scale 
scale 
a nine-point religious observance self 
(ROS) 

High ACB scale scorers obtained significantly 
higher Factor II (Mean = 7.94, SD 
3.24) than low ACB scale scorers (Mean 5.5 
SD = 3.24; #= 2.085, p- 5). Significant (at 
5 level or better) negative correlations (r) were 
»btained between the subjects’ ACB scale scores 


scores 


1An extended report of this study may be ob- 
tained without charge from Aron Wolfe Siegman 
Psychiatric Institute, University of Maryland School 
of Medicine; Baltimore 1, Maryland) or for a fee 
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and their So scale their RBS 
29) and their ROS scores (— .64). A 
positive correlation was obtained be- 
subjects’ ACB and n Ach scale scores 
which is not, however, in the predicted 
There were no significant correlations 
between the subjects’ ACB scale scores and their 
De scale, Factor I, and Factor III scores; nor 
was sex a significant variance in the 


sul 


scores ( 30), 
scores ( 

significant 
tween the 
(r 45). 
direction 


source of 
jects’ ACB scale scores. 

Psychopathy and family dissension, which were 
significant sources of variance in officially ad- 
judicated criminality not related to ad- 
mitted criminality. One possible explanation for 
this discrepancy is that there is no causal rela- 
tion between these two variables and criminality 
and that their relation to institutionalized delin- 
quency merely reflects the association of all three 
with Another 
possibility is that these two variables are causally 
related with criminality, but that the subjects of 
whom 
background 
these 
significant 


were 


conditions socioeconomic status. 


were of middle 
did not 
variables 


the present study, all of 


and upper socioeconomic 


vary sufficiently in relation to 
The 
between 
that 
is convicted or institutionalized, but not to crimi 
nality as such. The results concerning n Ach sug- 
gest the hypothesis that both very low and very 
high n Ach are related to criminality. In the pres- 
ent study, however, there were not enough sub- 
jects with sufficiently low n Ach to test both 
parts of the hypothesis. 
The significant relations 


failure to obtain a relationship 


sex and admitted criminality suggests 
sex may be related to whether or not 


one 


between the subjects’ 
n Ach, and emo- 
tional maladjustment scores are with 


the hypothesis that psychological variables are 


{CB scale scores and their So, 
consistent 


} 


significant sources of variance in criminality 
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FACTOR ANALYTIC SCALES OF COVERT AND OVERT 


HOSTILITY ? 


A. W. BENDIG 


University of Pittsburgh 


It is apparent that two oblique traits are meas- 
ured by the eight subscales of the Buss-Durkee 
Hostility Inventory (Bendig, 1961; Buss & Dur- 
kee, 1957) and that a pair of scores measuring 
these traits would be more diagnostic than a 
single total inventory score and also less re- 
dundant than eight subscale scores. The present 
research was undertaken with the aim of develop- 
ing at least preliminary forms of such scales. 

The 75-item hostility inventory was adminis- 
tered to two groups of college students enrolled 
in psychology courses. Group 1 consisted of 218 
subjects (163 men and 55 women) and Group 2 
of 151 subjects (55 men and 96 women). Using 
the responses of Group 1, the first 40 items were 
intercorrelated (phi), two centroid factors were 
extracted, and the factors rotated to oblique 
simple structure using the analytic Oblimax cri- 
terion. A similar procedure was followed for the 
last 35 items, again using the data from Group 1 
Eighteen items in the first analysis and 22 
items in the second showed at least one factor 
loading above .21 and these 40 items were se- 
lected for a third analysis, again using the Group 
1 data. In this third analysis 36 of the items 
showed large loadings on only one of the two 
factors. These 36 items were again intercorre- 

1An extended report of this study may be ob- 
tained without charge from A. W. Bendig (Depart- 
ment of Psychology, University of Pittsburgh; Pitts- 
burgh 13, Pa.) or for a fee from the American Docu- 
mentation Institute. Order Document No. 7054 from 
ADI Auxiliary Publications Project, Photoduplica- 
tion Service, Library of Congress; Washington 25, 
D. C., remitting in advance $1.25 for microfilm or 
$1.25 for photocopies. Make checks payable to: 
Chief, Photoduplication Service, Library of Congress 


lated, using the responses of Group 2, and the 
same factor analytic procedures applied. Of these 
36 items, 34 showed the same pattern of factor 
loadings as was found in the proceeding analyses. 
The correlations between the two oblique factors 
in the four analyses were .32, .32, .33, and .27. 

The content of the items loading each factor 
suggested that factors of “‘covert hostility” (CH) 
and “overt hostility’ (OH) had been identified. 
For the subjects in Group scores on the 22- 
item CH (drawn mainly from Buss and 
Durkee’s Irritability and Guilt subscales) had a 
reliability (Kuder-Richardson Formula 20) of 
.72, while the 14-item OH scale (primarily As- 
sault and Verbal Hostility items) had a reliability 
of .76. The scales were given to a new group of 
subjects (150 men and 15 The CH 
and OH scales showed rel .77 and .70 
for the men and .73 and .69 for the women. In- 
tercorrelations were .40 and .44 within each sex 
group. 

The item numbers 


scale 


women ). 


bilities of 


(Buss & Durkee, 1957, pp 


346-347) of the 34 valid items were: CH, 11, 16 


18, 19, 20, 21F, 28, 32, 36, 44, 45, 48, 52, 53, 54, 
56, 60, 66F, 69, and 71F: OH. 1F, 7, 15, 17F 
25, 27F, 39F, 43, 49, 51, 57, 63F, 65, and 70. It 
is suggested that these items can be used as at 
least preliminary measures of the two hostility 
factors. 
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ERRATUM 


A single error is duplicated in the article, “An Empirical Scale of Therapist Verbal 
Activity Level in the Initial Interview,” by Edmund S. Howe and Benjamin Pope, 
appearing in the December 1961 issue of this journal (J. consult. Psychol., 1961, 25, 


510-520). On page 512, line 7 
“intraclass r.”’ 


and on page 514, line 17, 


“interclass r” should read 











STUDY OF VALUES 
Third Edition 


Gordon W. Allport, Philip E. Vernon, and Gardner Lindzey 


Self-administering, easily-scored measure of six dominant interests in per- 
sonality: theoretical, economic, aesthetic, social, political, and religious. 
Designed primarily for use with college students or adults with equivalent 
education. 


Package of 33 le si bookle ls u ith manual, nel § 7.00. Spec meri Sé f. nel S. 48 


THE HENMON-NELSON TESTS OF MENTAL ABILITY 
College Level, Revised Edition 
Martin J. Nelson, Tom A. Lamke, and Paul C. Kelso 
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school, this omnibus-cycle group test of mental ability is especially valuable 
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Package of 35 lest booklets (Form A or B), net $3.60. 
“Me If-marking answer sheets (pkg. of 35) net $2.25. 
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America’s 


Psychologists 


A Survey of a Growing 
Profession 


By Kenneth E. Clark 


University of Minnesota 


A report of a study of American psychologists, 
sponsored by the APA Policy and Planning Board 
and supported by the National Science Foundation, 
which provides a clearer view of psychology in the 
mid-twentieth century by describing the people who 
are active in the field, and the nature of their a 

tivities. Some have been outstanding in research pro- 
ductivity. What are they like? How do they differ 
from their less productive colleagues? Are thers 
major differences among psychologists in. say, ex- 
perimental psychology and those in, say, industrial 
psychology? To answer such questions, Dr. Clark 
and his collaborators have studied the undergraduate 
education, family backgrounds, types of jobs held 
and attitudes and values of different groups of 
psychologists? How many persons in the United 
States are engaged in predominantly psychological 
work? Are recent recipients of the PhD similar to or 
different from those who received the degree 10 or 20 
years ago? Where are psychologists employed? W hat 
do they read? These are samples of the question 
that are discussed and on which substantial amounts 


of factual data are given in the pages of this report 


Price, $1.00 
* 


American 
Psychological Association 


1333 16th St., Washington 6. D.C, 
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Therapeutic Camping 


At Devereux Schools camping is a 
part of the total treatment program. 
Amidst sylvan beauty in Maine or 
Pennsylvania, or along the peaceful 
shores of the Pacific Coast, the 
Devereux camper, with the assist- 
ance of experienced staff, consoli- 
dates emotional and educational 
gains made previously during the 
year. 

It is camping with a purpose. It 
is a time of informality, of close 
constructive relationships between 
camper and counselor. Within each 
camp are groups which are homoge- 
neous with respect to age and gen- 
erai type of problem. Each camp has 
its own modern physical plant, 
trained staff, recreational and ac- 
tivity centers, and clinical resources 
to take care of the child’s needs at 
any hour of the day. 

Therapeutic camping is carefully 
planned and as purposeful as any 


other activity at Devereux Schools. 
Evaluative studies based upon a 
multi-disciplinary approach (educa- 
tion, psychiatry, psychology, medi- 
cine and social work) to a child’s 
problem guide in the formulation 
and implementation of a carefully 
designed therapeutic program for 
each child. i 


Applications for Devereux 
Schools’ Camps may be made at this 
time. Your inquiries are invited. 
For further irformation and litera- 
ture, East Coast residents may con- 
tact Mr. Charles J. Fowler, Director 
of Admissions, Devereux Schools, 
Devon, Pa. Those in the West Coast 
area, Mr. Keith A. Seaton, Regis- 
trar, Devereux Schools in Califor- 
nia, Santa Barbara, Calif. South- 
western residents, Mr. Richard 
Danko, Executive Administrator, 
Devereux Schools in Texas, Vic- 
toria, Texas. 


*Since the founding of Devereux Schools in 1912, a philosophy of Special Edu- 
cation and therapeutic techniques of rehabilitation and care have been developed 
for the sole reason of insuring to the greatest extent the future happiness and 
well-being of the retarded or emotionally disturbed child. Many of these tech- 
niques originated at Devereux Schools. This series is launched during the 
Schools’ 50th Anniversury in order that you may better understand the methods 
and goals of Devereux Schools. 


The Devereux Foundation 


A nonprofit organization Devon, Penna. « Santa Barbara, Calif. « Victoria, Texas 


Helena T. Devereux, Founder and Consultant +* Edward L. French, Ph.D., President and Director 
4. Clifford Scott, M.D. Anne Howe, MS.W. 
Director of Psychiotry Director of Socic! Work 


Kenneth E. Evons, 8.5. 
Director of Education 


ON cy, top sain Rite 


Jobn R. Kleisor, Ph.D. -. 
Director of Clinical Psychology 
Walter M. Ubler, M.D. é 
Director of Medice! Services 
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Need an individual test which quickly 
provides a stable and reliable estimate 
of intelligence in 2 or 3 minutes per 
form? Has three forms? Does not de- 
pend on verbal production or subjective 
scoring? Can be used with the severely 
physically handicapped (even paralyzed), 
if they can signal yes-no? Handles two- 
year-olds and superior adults within the 
same short series of items and the same 
format? Has been painstakingly stand- 
ardized? Builds rapport with intrinsi- 
cally interesting and simple procedures 
based on multiple drawings? 


Ammons and Ammons 


QUICK 
TEST 
(QT) 


Order ‘Tester’s Set including manual, 
card materials, keys, and 100 answer 
sheets ($8.00 complete). 


For further information, write to 


PSYCHOLOGICAL TEST 
SPECIALISTS 
Box 1441 
Missoula, Montana 








NEW TEST!! 





























